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Abstract. In this paper wc introduce the olog, or ontology log, a category- 
theoretic model for knowledge representation (KR). Grounded in formal math- 
ematics, ologs can be rigorously formulated and cross-compared in ways that 
other KR models (such as semantic networks) cannot. An olog is similar to 
a relational database schema; in fact an olog can serve as a data repository 
if desired. Unlike database schemas, which are generally difficult to create or 
modify, ologs are designed to be user-friendly enough that authoring or re- 
configuring an olog is a matter of course rather than a difficult chore. It is 
hoped that learning to author ologs is much simpler than learning a database 
definition language, despite their similarity. We describe ologs carefully and 
illustrate with many examples. As an application we show that any primitive 
recursive function can be described by an olog. We also show that ologs can 
be aligned or connected together into a larger network using functors. The 
various methods of information flow and institutions can then be used to in- 
tegrate local and global world- views. We finish by providing several different 
avenues for future research. 
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1. Introduction 

Scientists have a pressing need to organize their experiments, their data, their 
results, and their conclusions into a framework such that this work is reusable, 
transferable, and comparable with the work of other scientists. In this paper, 
we will discuss the "ontology log" or olog as a possibility for such a framework. 
Ontology is the study of what something is, i.e the nature of a given subject, and 
ologs are designed to record the results of such a study. The structure of ologs is 
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based on a branch of mathematics caUed category theory. An olog is roughly a 
category that models a given real-world situation. 

The main advantages of authoring an olog rather than writing a prose description 
of a subject are that 

• an olog gives a precise formulation of a conceptual world-view, 

• an olog can be formulaically converted into a database schema, 

• an olog can be extended as new information is obtained, 

• an olog written by one author can be easily and precisely referenced by 
others, 

• an olog can be input into a computer and "meaningfully stored", and 

• different ologs can be compared by functors, which in turn generate auto- 
matic terminology translation systems. 

The main disadvantage to using ologs over prose, aside from taking more space on 
the page, is that writing a good olog demands a clarity of thought that ordinary 
writing or conversation can more easily elide. However, the contemplation required 
to write a good olog about a subject may have unexpected benefits as well. 

A category is a mathematical structure that appears much like a directed graph: 
it consists of objects (often drawn as nodes or dots, but here drawn as boxes) and 
arrows between them. The feature of categories that distinguishes them from graphs 
is the ability to declare an equivalence relation on the set of paths. A functor is a 
mapping from one category to another that preserves the structure (i.e. the nodes, 
the arrows, and the equivalences). If one views a category as a kind of language (as 
we shall in this paper) then a functor would act as a kind of translating dictionary 
between languages. There are many good references on category theory, including 
[LS| , |Sic| , |Pie| , |BWlj , |Awo| , and [Mac] ; the first and second are suited for general 
audiences, the third and fourth are suited for computer scientists, and the fifth and 
sixth are suited for mathematicians (in each class the first reference is easier than 
the second). 

A basic olog, defined in Section [21 is a category in which the objects and ar- 
rows have been labeled by English-language phrases that indicate their intended 
meaning. The objects represent types of things, the arrows represent functional 
relationships (also known as aspects, attributes, or obscrvablcs) , and the commu- 
tative diagrams represent facts. Here is a simple olog about an amino acid called 
arginine f[W]): 



(1) 



D 



an amino acid 
found in dairy 



arginme 



an electrically- 
charged side 
chain 



X 



R 



an amino acid 



a side chain 



has 




N 



an amine group 



a carboxylic acid 
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The idea of representing information in a graph is not new. For example the 
Resource Descriptive Framework (RDF) is a system for doing just that [CM| . The 
key difference between a category and a graph is the consideration of paths, and 
that two paths from A to B may be declared identical in a category (see [Spi3| ). 
For example, we can further declare that in Diagram ([T]), the diagram 

(2) A 

X 

commutes, i.e. that the two paths Az ^R are equivalent, which can be translated 

as follows. Let A be a molecule of arginine. On the one hand A, being an amino 
acid, has a side chain; on the other hand A has an electrically-charged side-chain, 
which is of course a side chain. We seem to have associated two side-chains to A, 
but in fact they both refer to the same physical thing, the same side-chain. Thus, 
the two paths A ^ R are deemed equivalent. The fact that this equivalence may 
seem trivial is not an indictment of the category idea but instead reinforces its 
importance — we must be able to indicate obvious facts within a given situation 
because what is obvious is the most essential. 

While many situations can be modeled using basic ologs (categories), we often 
need to encode more structure. For this we will need so-called sketches. An olog 
will be defined as a finite limit, finite colimit sketch (see }BW2| ). meaning we have 
the abihty to encode objects ("types"), arrows ("aspects"), commutative diagrams 
("facts"), as well as finite limits ("layouts") and finite colimits ("groupings"). 

Throughout this paper, whenever we refer to "the author" of an olog we am 
referring to the fictitious person who created it. We will refer to ourselves, David 
Spivak and Robert Kent, as "we" so as not to confuse things. 

Warning 1.0.1. The author of an olog has a world- view, some fragment of which is 
captured in the olog. When person A examines the olog of person B, person A may 
or may not "agree with it." For example, person B may have the following olog 



a marriage 




a man a woman 



which associates to each marriage a man and a woman. Person A may take the 
position that some marriages involve two men or two women, and thus see B's olog 
as "wrong." Such disputes arc not "problems" with either A's olog or B's olog, they 
are discrepancies between world-views. Hence, throughout this paper, a reader R 
may see a displayed olog and notice a discrepancy between R's world-view and our 
own, but R should not worry that this is a problem. This is not to say that ologs 
need not follow rules, but instead that the rules are enforced to ensure that an olog 
is structurally sound, rather than that it "correctly reflects reality," whatever that 
may mean. 

1.1. Plan of this paper. In this paper, we will define ologs and give several 
examples. We will state some rules of "good practice" which help one to author ologs 
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that are meaningful to others and easily extendable. We will begin in Section [2] by 
laying out the basics: types as objects, aspects as arrows, and facts as commutative 
diagrams. In Section[31 we will explain how to attach "instance" data to an olog and 
hence realize ologs as database schemas. In Section 21 we will discuss meaningful 
constraints betweeen ologs that allow us to develop a highcr-dimcnsional web of 
information called an information system, and we will discuss how the various 
parts of such a system interact via information channels. In Sections [5] and El we 
will extend the olog definition language to include "layouts" and "groupings", which 
make for more expressive ologs; we will also describe two applications, one which 
explicates the computation of the factorial function, and the other which defines a 
notion from pure mathematics (that of pscudo- metric spaces). Finally, in Section 
d we will discuss some possible directions for future research. 

For the remainder of the present section, we will explain how ologs relate to 
existing ideas in the field of knowledge representation. 

1.2. The semantic advantage of ologs: modularity. The difference between 
ologs and prose is modularity: small conceptual pieces can form large ideas, and 
these pieces work best when they are reusable. The same phenomenon is true 
throughout computer science and mathematics. In programming languages, mod- 
ularity brings not only vast efficiency to the writing of programs but enables an 
"abstraction barrier" that keeps the ideas clean. In mathematics, the most powerful 
results arc often simple lemmas that are reusable in a wide variety of circumstances. 

Web pages that consist of prose writing are often referred to as information silos. 
The idea is that a silo is a "big tube of stuff" which is not organized in any real 
way. Links between web pages provide some structure, but such a link does not 
carry with it a precise method to correlate the information within the two pages. 
Similarly in science, one author may reference another paper, but such a reference 
carries very little structure — it just points to a silo. 

Ologs can be connected with links which are much richer than the link between 
two silos could possibly be. Individual concepts and connections within one olog 
can be "functorially aligned" with concepts and connections in another. A functor 
creates a precise connection between the work of one author and the work of another 
so that the precise nature of the comparison is not left to the reader's imagination 
but explicitly specified. The ability to incorporate mathematical precision into the 
sharing of ideas is a central feature of ologs. 

1.3. Relation to other models. There are many languages for knowledge repre- 
sentation (KR). For example, there are database languages such as SQL, ontology 
languages such as RDF and OWL, the language of Semantic Nets, and others (see 
[Bor] ) . One may ask what makes the olog concept different or better than the 
others. 

The first response is that ologs are closely related to the above ideas. Indeed, all 
of these KR models can be "categorified" (i.e. phrased in the language of category 
theory) and related by functors, so that many of the ideas align and can be trans- 
ferred between the different systems. In fact, as we will make clear in Section O 
ologs are almost identical to the categorical model of databases presented in |Spi2| . 

However, ologs have advantages over many existing KR models. The first ad- 
vantage arises from the notion of commutative diagrams (which allow us to equate 
different paths through the domain, see Section 12. 3p and of limits and colimits 
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(which allow us to lay out and group things, see Sections [5] and E]) . The additional 
expressivity of ologs give them a certain semantic clarity and interoperability that 
cannot be achieved with graphs and networks in the usual sense. The second ad- 
vantage arises from the notion of olog morphisms, which allow the definition of 
meaningful constraints between ologs. With this in hand, we can integrate a set 
of similar ologs into a single information system, and go on to define information 
fusion. This will be discussed further Section [H 

In the remainder of this section we will provide a few more details on the relation- 
ship between ologs and each of the above KR models: databases, RDF/OWL, and 
semantic nets. The reader who does not know or care much about other systems 
of knowledge representation can skip to Section 11.41 

1.3.1. Ologs and Databases. A database is a system of tables, each table of which 
consists of a header of columns and a set of rows. A table represents a type of thing 
T, each column represents an attribute of T, and each row represents an example 
of T. An attribute is itself a "type of thing", so each column of a table points to 
another table. 

The relationship between ologs and databases is that every box B in an olog 
represents a type of thing and every arrow B ^ X emanating from B represents 
an attribute of B (whose results arc of type X). Thus the boxes and arrows in an 
olog correspond to tables and their columns in a database. The rows of each table 
in a database will correspond to "instances" of each type in an olog. Again, this 
will be made more clear in Section [3] or one can see |Spi2| or [Ken5| . 

The point is that every olog can serve as a database schema, and the schemas 
represented by ologs range from simple (just objects and arrows) to complex (in- 
cluding commutative diagrams, products, sums, etc.). However, whereas database 
schemas are often prescriptive ("you must put your data into this format!"), ologs 
are usually descriptive ("this is how I see things"). One can think of an olog as 
an interface between people and databases: an olog is human readable, but it is 
also easily converted to a database schema upon which powerful applications can 
be put to work. Of course, if one is to use an olog as a database schema, it will 
become prescriptive. However, since the intention of each object and arrow is well- 
documented (as its label), schema evolution would be straightforward. Moreover, 
the categorical structure of ologs allows for functorial data migration by which one 
can transfer the instance data from an older schema to the current one (see |Spi2| ). 

1.3.2. Ologs and RDF / OWL. In |Spi2| , the first author explained how a categor- 
ical database can be converted into an RDF triple store using the Grothendieck 
construction. The main difference between a categorical database schema (or an 
olog) and an RDF schema is that one cannot specify commutativity in an RDF 
schema. Thus one cannot express things like "the woman parent of a person x is 
the mother of x." Without this expressivity, it is hard to enforce much rigor, and 
thus RDF data tends to be too loose for many applications. 

OWL schemas, on the other hand, can express many more constraints on classes 
and properties. We have not yet explored the connection, nor compared the expres- 
sive power, of ologs and OWL. However, they are significantly different systems, 
most obviously in that OWL relies on logic where ologs rely on category theory. 

1.3.3. Semantic Nets. On the surface, ologs look the most like semantic networks, 
or concept webs, but there are important differences between the two notions. First, 
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arrows in a semantic network need not indicate functions; they can be relations. So 
there could be an arrow '"a father"' '"a child^ in a semantic network, but not in 
an olog (see Section 12.2.31 for how the same idea is expressible in an olog) . There is 
a nice category of sets and relations, often denoted Rel, but this category is harder 
to reason about than is the ordinary category of sets and functions (often denoted 
Set). Thus, as mentioned above, semantic networks are categorifiable (using Rel), 
but this underlying formalism does not appear to play a part in the study or use 
of semantic networks. However, some attempt to integrate category theory and 
neural nets has been made, see jHC| . 

Moreover, commutative diagrams and other expressive abilities held by ologs are 
not generally part of the semantic network concept (see [Sowl] ). For these reasons, 
semantic networks tend to be brittle: minor changes can have devastating effects. 
For example, if two semantic networks are somehow synced up and then one is 
changed, the linkage must be revised or may be altogether broken. Such a disaster 
is often avoided if one uses categories: because different paths can be equivalent, 
one can simply add new ideas (types and aspects) without changing the semantic 
meaning of what was already there. As section 14.41 demonstates with an extended 
example, conceptual graphs, which are a popular formalism for semantics nets, can 
be linearized to ologs, thereby gaining in precision and expressibility. 
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2. Types, aspects, and facts 

In this section we will explain basic ologs, which involve types, aspects, and facts. 
A basic olog is a category in which each object and arrow has been labeled by text; 
throughout this paper we will assume that text to be written in English. 

The purpose of this section is to show how one can convert a real-world situation 
into an olog. It is probably impossible to explain this process precisely in words. 
Instead, we will explain mainly by example. We will give "rules of good practice" 
that lead to good ologs. While these rules are not strictly necessary, they help to 
ensure that the olog is properly formulated. As the Dalai Lama says, "Learn the 
rules so you know how to break them properly." 
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2.1. Types. A type is an abstract concept, a distinction the author has made. We 
represent each type as a box containing a singular indefinite noun phrase. Each of 
the following four boxes is a type: 



(3) 



a man 



an automobile 



a pair (a,w), where w is 
a woman and a is an au- 
tomobile 



a pair (a, w) where w is 
a woman and a is a blue 
automobile owned by w 



Each of the four boxes in ^ represents a type of thing, a whole class of things, 
and the label on that box is what one should call each example of that class. Thus 
'~a man"' does not represent a single man, but the set of men, each example of 
which is called "a man'Q. Similarly, the bottom right-hand box in represents 
an abstract type of thing, which probably has more than a million examples, but 
the label on the box indicates a common name for each such example. 

Typographical problems emerge when writing a text-box in a line of text, e.g. 
the text-box | a man | seems out of place here, and the more in-line text-boxes one 
has in a given paragraph, the worse it gets. To remedy this, we will denote types 
which occur in a line of text with corner-symbols, e.g. we will write '"a man"' instead 
of I a man | . 

2.1.1. Types with compound structures. Many types have compound structures; i.e. 
they are composed of smaller units. Examples include 

(4) 



a man and 
a woman 



a food / and a child c 
such that c ate all of / 



a triple (j>, a,j) where p is 
a paper, a is an author of 
p, and j is a journal in 
which p was published 



It is good practice to declare the variables in a "compound type", as we did in the 
last two cases of ([4]). In other words, it is preferable to replace the first box above 
with something like 



a man m and 
a woman w 



or 



a pair (m, w) 
where m is a man 
and w is a woman 



so that the variables (m, w) are clear. 

Rules of good practice 2.1.1. A type is presented as a text box. The text in that 
box should 

(i) begin with the word "a" or "an"; 

(ii) refer to a distinction made and recognizable by the author; 

(iii) refer to a distinction for which instances can be documented; 

(iv) not end in a punctuation mark; 



^In other words, types in ologs are intentional, rather than extensional — the label on a type 
describes its intention. The extension of a type will be captured by instance data; see Section |3]. 
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(v) declare all variables in a compound structure. 

The first, second, and third rules ensure that the class of things represented 
by each box appears to the author as a well-defined set; see Section [3] for more 
details. The fourth and fifth rules encourage good "readability" of arrows, as will 
be discussed next in Section [^T^ 

We will not always follow the rules of good practice throughout this document. 
We think of these rules being followed "in the background" but that we have "nick- 
named" various boxes. So '"Steve"' may stand as a nickname for '"a thing classified 
as Steve"' and '"arginine"' as a nickname for '"a molecule of arginine"'. 

2.2. Aspects. An aspect of a thing a; is a way of viewing it, a particular way in 
which X can be regarded or measured. For example, a woman can be regarded as 
a person; hence "being a person" is an aspect of a woman. A man has a height 
(say, taken in inches), so "having a height (in inches)" is an aspect of a man. In 
an olog, an aspect of A is represented by an arrow A ^ B, where B is the set of 
possible "answers" or results of the measurement. For example when observing the 
height of a man, the set of possible results is the set of integers, or perhaps the set 
of integers between 20 and 120. 



a woman 


IS 


a person 





, has as height (in inches) 



(6) I a man | ^ an integer between 20 and 120 



We will formalize the notion of aspect by saying that aspects are functional 
relationshipsQ Suppose we wish to say that a thing classified as X has an aspect 
/ whose result set is Y. This means there is a functional relationship called / 
between X and Y, which can be denoted f: X ^ Y. We call X the domain of 
definition for the aspect /, and we call Y the set of result values for /. For example, 
a man has a height in inches whose result is an integer, and we could denote this 
by /i: M — )> Int. Here, AI is the domain of definition for height and Int is the set 
of result values. 

A set may always be drawn as a blob with dots in it. If X and Y are two sets, 
then a a function from X to Y, denoted f : X Y can be presented by drawing 
arrows from dots in blob X to dots in blob Y. There are two rules: 

(i) each arrow must emanate from a dot in X and point to a dot in Y; 

(ii) each dot in X must have precisely one arrow emanating from it. 

Given an element x € X, the arrow emanating from it points to some clement 
y (zY, which we call the image of x under f and denote f{x) — y. 

Again, in an olog, an aspect of a thing X is drawn as a labeled arrow point- 
ing from X to a "set of result values." Let us concentrate briefly on the arrow in 
([S]). The domain of deflnition is the set of women (a set with perhaps 3 billion 
elements); the set of result values is the set of persons (a set with perhaps 6 billion 
elements). We can imagine drawing an arrow from each dot in the "woman" set to 
a unique dot in the "person" set. No woman points to two different people, nor to 
zero people — each woman is exactly one person — so the rules for a functional 
relationship are satisfied. Let us now concentrate briefly on the arrow in ([5]). The 



^In type theory, what we here call aspects are called functions. Since our types are not fixed 
sets (see Section |3}, we preferred a term that was less formal. 
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domain of definition is the set of men, the set of result values is the set of integers 
{20, 21, 22, . . . , 119, 120}. We can imagine drawing an arrow from each dot in the 
"man" set to a single dot in the "integer" set. No man points to two different 
heights, nor can a man have no height: each man has exactly one height. Note 
however that two different men can point to the same height. 

2.2.1. Invalid aspects. We tried above to clarify what it is that makes an aspect 
"valid", namely that it must be a "functional relationship." In this subsection we 
will present two arrows which on their face may appear to be aspects, but which 
on closer inspection are not functional (and hence are not valid as aspects). 
Consider the following two arrows: 



(7*) 
(8*) 



a person 


has 


a child 





a mechanical pencil 



a piece of lead 



A person may have no children or may have more than one child, so the first arrow 
is invalid: it is not functional because it does not satisfy rule (2) above. Similarly, 
if we drew an arrow from each mechanical pencil to each piece of lead it uses, it 
would not satisfy rule (2) above. Thus neither of these is a valid aspect. 

Of course, in keeping with Warning 11.0.11 the above arrows may not be wrong 
but simply reflect that the author has a strange world- view or a strange vocabulary. 
Maybe the author believes that every mechanical pencil uses exactly one piece of 
lead. If this is so, then '"a mechanical penciP piece of lead"' is indeed a 

valid aspect! Similarly, suppose the author meant to say that each person was once 
a child, or that a person has an inner child. Since every person has one and only one 
inner child (according to the author), the map '"a person"' ^'^ chiid^ child"' 
is a valid aspect. We cannot fault the author for such a view, but note that we 
have changed the name of the label to make its intention more explicit. 

2.2.2. Reading aspects and paths as English phrases. Each arrow (aspect) X 
can be read by first reading the label on its source box (domain of definition) X, 
then the label on the arrow /, and finally the label on its target box (set of result 
values) Y . For example, the arrow 



(9) 



a book 



has ; 



first author 



a person 



is read "a book has as first author a person", a valid English sentence. 

Sometimes the label on an arrow can be shortened or dropped altogether if it 
is obvious from context. We will discuss this more in Section 12.31 but here is a 
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common example from the way we write ologs 
(10) 

I a pair {x,y) where 
X and y are integers 





an integer 



an integer 



Neither arrow is readable by the protocol given above (e.g. "a pair {x, y) where x 
and y are integers x an integer" is not an English sentence), and yet it is obvious 
what each map means. For example, given the pair (8, 11) which belongs in box A, 
application of arrow x would yield 8 in box B. The label x can be thought of as a 
nickname for the full name "yields, via the value of a;," and similarly for y. We do 
not generally use the full name for fear that the olog would become cluttered with 
text. 

One can also read paths through an olog by inserting the word "which" after 
each intermediate box. For example the following olog has two paths of length 3 
(counting arrows in a chain): 

(11) 



a child 


IS 


a person 





has as parents 



a pair [w, m) 
where w is a 
woman and m 



a woman 



has, as birthday 



a date 


includes 


a year 





The top path is read "a child is a person, which has as parents a pair [w, m) where 
w is a woman and m is a man, which yields, via the value of w, a woman." The 
reader should read and understand the content of the bottom path. 

2.2.3. Converting non- functional relationships to aspects. There are many rela- 
tionships that arc not functional, and these cannot be considered aspects. Of- 
ten the word "has" indicates a relationship — sometimes it is functional as in 
•"a person"' > '"a stomach^, and sometimes it is not, as in '"a father^ 
•"a child"'. (Obviously, a father may have more than one child.) A quick fix would 

be to replace the latter by '"a father^ '"a set of children"'. This is ok, but the 
relationship between '"a child"' and '"a set of children"' then becomes an issue to deal 
with later. There is another way to indicate such "non-functional" relationships. 

In mathematics, a relation between sets Ai, A2, and so on through An is defined 
to be a subset of the Cartesian product 

RC Ai X A2 X ■■ ■ X An- 
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The set R represents those sequences (ai, 02, ... , a„) that are so-related. In an clog, 
we represent this as follows 




For example, 



a sequence {p, a, j) where p 
is a paper, a is an author 
of p, and j is a journal in 
which p was published 



Ai 



a paper 



an author 



a journal 



Whereas Ai x A2 x A3 includes all possible triples {p, a, j) where a is a person, p 
is a paper, and j is a journal, it is obvious that not all such triples are found in R. 
Thus R represents a proper subset of Ai x A2 x A3. 

Rules of good practice 2.2.1. An aspect is presented as a labeled arrow, pointing 
from a source box to a target box. The arrow text should 

(i) begin with a verb; 

(ii) yield an English sentence, when the source-box text followed by the arrow 
text followed by the target-box text is read; 

(iii) refer to a functional dependence: each instance of the source type should give 
rise to a specific instance of the target type; 

2.3. Facts. In this section wc will discuss facts and their relationship to "path 
equivalences." It is such path equivalences, which exist in categories but do not 
exist in graphs, that make category theory so powerful. See |Spi3| for details. 
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Given an olog, the author may want to declare that two paths are equivalent. 
For example consider the two paths from A to C in the olog 



(12) 



a person 



has as parents 



a pair {w, m) 
where w is a 
woman and 
m is a man 



has 



as mother 



a woman 



We know as English speakers that a woman parent is called a mother, so these two 
paths A ^ C should be equivalent. A more mathematical way to say this is that 
the triangle in Olog p2p commutes. 

A commutative diagram is a graph with some declared path equivalences. In 
the example above we concisely say "a woman parent is equivalent to a mother." 
We declare this by defining the diagonal map in p2p to be the composition of the 
horizontal map and the vertical map. 

We generally prefer to indicate a commutative diagram by drawing a check-mark, 
v^, in the region bounded by the two paths, as in Olog (|12p . Sometimes, however, 
one cannot do this unambiguously on the 2-dimensional page. In such a case we 
will indicate the commutative diagrams (fact) by writing an equation. For example 
to say that the diagram 

/ 




commutes, we could either draw a checkmark inside the square or write the equation 
f;g = h;i above it. Either way, it means that "/ then g" is equivalent to "h then 



2.3.1. More complex facts. Recording real-world facts in an olog can require some 
creativity. Whereas a fact like "the brother of ones father is ones uncle" is recorded 
as a simple commutative diagram, others are not so simple. We will try to show 
the range of expressivity of commutative diagrams in the following two examples. 
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Example 2.3.2. How would one record a fact like "a truck weighs more than a car"? 
We suggest something like this: 




a pair {x,y) where 
X and y are phys- 
ical objects and x 
weiglis more than y 



where both top and bottom commute. This olog exemplifies the fact that simple 
sentences sometimes contain large amounts of information. While the long map 
may seem to suffice to convey the idea "a truck weighs more than a car," the path 
equivalences (declared by check-marks) serve to ground the idea in more basic types. 
These other types tend to be useful for other purposes, both within the olog and 
when connecting it to others. 

2.3.3. Specific facts at the olog level. Another fact one might wish to record is that 
"John Doe's weight is 150 lbs." This is established by declaring that the following 
diagram commutes: 



(13) 



John Doc 



has as weight (in pounds) 



150 



a person 



has as weight (in pounds) 



a real number 



If one only had the top line, it would be less obvious how to connect its information 
with that of other ologs. (See Section [3] for more on connecting different ologs). 

Note that the top line in Diagram (|13p might also be considered as existing at 
the "data level" rather than at the "olog level." In other words, one could see John 
Doe as an "instance" of '"a person"', rather than as a type in and of itself, and 
similarly see 150 as an instance of ^a real number^. This idea of an olog having a 
"data level" is the subject of the Section [3] 

Rules of good practice 2.3.4. A fact is the declaration that two paths (having the 
same source and target) in an olog are equivalent. Such a fact is either presented 
as a checkmark between the two paths (if such a check-mark is unambiguous) or 
by an equation. Every such equivalence should be declared; i.e. no fact should be 
considered too obvious to declare. 
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3. Instances 



The reader at this point hopefully sees an olog as a kind of "concept map," and 
it is one, albeit a concept map with a formal structure (implicitly coming from 
category theory) and specific rules of good practice. In this section we will show 
that one can also load an olog with data. Each type can be assigned a set of 
instances, each aspect will map the instances of one type to instances of the other, 
and each fact will equate two such mappings. We give examples of these ideas in 
Section [231 

In Sect ion we will show that in fact every olog can also serve as the layout for 
a database. In other words, given an olog one can immediately generate a database 
schema^ i.e. a system of tables, in any reasonable data definition language such 
as that of SQL. The tables in this database will be in one-to-one correspondence 
with the types in the olog. The columns of a given table will be the aspects of 
the corresponding type, i.e. the arrows whose source is that type. Commutative 
diagrams in the olog will give constraints on the data. 

In fact, this idea is the basic thesis in |Spi2| , even though the word olog does 
not appear in that paper. There it was explained that a category C naturally can 
be viewed as a database schema and that a functor I: C ^ Set, where Set is the 
category of sets, is a database state. Since an olog is a drawing of a category, it is 
also a drawing of a database schema. The current section is about the "states" of 
an olog, i.e. the kinds of data that can be captured by it. 

3.1. Instances of types, aspects, and facts. Recall from Section [2] that basic 
ologs consist of types, displayed as boxes; aspects, displayed as arrows; and facts, 
displayed as equations or check-marks. In this section we discuss the instances of 
these three basic constructions. The rules of good practice (|2.1.11 E".2.11 and l2.3.4|) 
were specifically designed to simplify the process of finding instances. 

3.1.1. Instances of types. According to Rules 1^.1.11 each box in an olog contains 
text which should refer to a distinction made and recognizable by the author 
for which instances can be documented. For example if my olog contains a 
box 



then I must have some concept of when this situation occurs. Every time I witness 
a new person-cat petting, I document it. Whether this is done in my mind, in 
a ledger notebook, or on a computer does not matter; however using a computer 
would probably be the most self-explanatory. Imagine a computer program in which 
one can create ologs. Clicking a text box in an olog results in it "opening up" to 
show a list of documented instances of that type. If one is reading the CBS news 
olog and clicks on the box '"an episode of 60 Minutes"', he or she should see a list of 
all episodes of the TV show "60 Minutes." If we wish to document a new person-cat 
petting incident we click on the box in p4p and add this new instance. 

3.1.2. Instances of aspects. According to Rules [^2.11 each arrow in an olog should 
be labeled with text that refers to a functional relationship between the source box 
and the target box. A functional relationship f : A B between finite sets A and 
B can always be written as a 2-column table: the first column is filled with the 



(14) 



a pair (p, c) where p 
is a person, c is a cat, 
and p has petted c 
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instances of type A and the second column is filled with their /-values, which are 
instances of type B. 

For example, consider the aspect 

/i r-\ I 1 orbits 

(15) I a moon | 



a planet 



We can document some instances of this relationship using the following table: 



(16) 



orbits 


a moon 


a planet 


The Moon 


Earth 


Phobos 


Mars 


Dcimos 


Mars 


Ganymede 


Jupiter 


Titan 


Saturn 



Clearly, this table of instances can be updated as more moons are discovered by 
the author (be it by telescope, conversation, or research). 

The correspondence between aspect and Table makes it clear that ologs 
can serve to hold data which exemplifies the author's world- view. In Section 13.21 
we will show that ologs (which have many aspects and facts) can serve as bona fide 
database schemas. 

3.1.3. Instances of facts. Recall the following olog: 



a person 



has as parents 



a pair {w, m) 
where w is a 
woman and 
m is a man 



has 



as inothc: 



a woman 



and consider the following instances of the three aspects in it: 



(17) 



has as parents 


a person 


a pair (w, m) ... 


Cain 


(Eve, Adam) 


Abel 


(Eve, Adam) 


Chelsey 


(Hillary, Bill) 



w 


a pair (w, to) ... 


a woman 


(Eve, Adam) 


Eve 


(Hillary, BiU) 


Hillary 


(Margaret, Samuel) 


Margaret 


(Emily, Kris) 


Emily 



has as mother 


a person 


a woman 


Cain 


Eve 


Abel 


Eve 


Chelsey 


Hillary 



When we declare that the diagram in commutes (using the check- mark), 
we are saying that for every instance of '"a person^ (of which we have three: Cain, 
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Abel, and Chelsey), the two paths to '~a woman'' give the same answers. Indeed, 
for Cain the two paths are: 

(i) Cain (Eve, Adam) M> Eve; 

(ii) Cain Eve; 

and these answers agree. If one changed any instance of the word "Eve" to the word 
"Steve" in one of the tables in pT|) . some pair of paths would fail to agree. Thus 
the "fact" that the diagram in commutes ensures that there is some internal 
consistency between the meaning of parents and the meaning of mother, and this 
consistency must be born out at the instance level. 
All of this will be formalized in Section r3. 2. 21 

3.2. The relationship between ologs and databases. Recall from Section [5.1.1l 

that we can imagine creating an olog on a computer. The user creates boxes, arrows, 
and compositions, hence creating a category C. Each text-box x in the olog can 
be "clicked" by the computer mouse, an action which allows the user to "view the 
contents" of x. The result will be a set of things, which we might call I{x) € Set, 
whose elements are things of type x. So clicking on the box '"a man"' one sees 
/(•"a man"'), the set of everything the author has documented as being a man. For 
each aspect f:x—^yoix, the user can see a function from the set I{x) to I{y), 
perhaps as a 2-column table as in (|17|) . 

The type x may have many aspects, which we can put together into a single 
multi-column table. Its columns are the aspects of x, and its rows are the elements 
of I{x). Consider the following olog, taken from [Spi2| where it was presented as a 
database schema. 



manager 




string 



The type '"Employee"' has four aspects, namely manager (valued in ^Employee"'), 
works in (valued in '"department"'), and first name and last name (valued in 
'"string"'). As a database, each type together with its aspects form a multi-column 
table, as in the following example. 

Example 3.2.1. We can convert Olog (|18p into a database schema. Each box rep- 
resents a table, each arrow out of a box represents a column of that table. Here is 
an example state of that database. 
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(19) 



employee 


Id 


first name 


last name 


manager 


works in 


101 


David 


Hilbcrt 


103 


qlO 


102 


Bcrtrand 


Russell 


102 


x02 


103 


Alan 


Turing 


103 


qlO 






department 




Id 


name 


secretary 


qlO 


Sales 


101 


x02 


Production 


102 



string 
Id 



ab 



Note that every arrow f : x y oi Olog (fT5|) is represented in Database (fTO|) as a 
column of table x, and that every cell in that column can be found in the Id column 
of table y. For example, every cell in the "works in" column of table employee 
can be found in the Id column of table department. 

The point is that ologs can be drawn to represent a world-view (as in Section 
but they can also store data. Rules 1,2, and 3 in l2.1.1l align the construction of 
an olog with the ability to document instances for each of its types. 

3.2.2. Instance data as a set-valued functor. Let C be an olog. Section [3] so far has 
described instances of types, aspects, and facts and how all of these come together 
into a set of interconnected tables. The assignment of a set of instances to each 
type and a function to each aspect in C, such that the declared facts hold, is called 
an assignment of instance data for C. More precisely, instance data on C is a functor 
C Set, as in Definition 13.2.31 

Definition 3.2.3. Let C be a category (olog) with underlying graph \C\, and let 
Set denote the category of sets. An instance of C (or an assignment of instance 
data for C) is a functor I: C ^ Set. That is, it consists of 

• a set I{x) for each object (type) x in C, 

• a function /(/) : I{x) — )■ /(y) for each arrow (aspect) /: a; — ?■ y in C, and 

• for each fact (path-equivalence or equation) □ 

/l I /2 ! ■ ' ■ 1 fn — /l i /2 I ■ ' ' i frn 

declared in C, an equality of functions 

/(A) ; /(/2) ; • • • ; /(/„) = /(/() ; /(/^) ; • • • ; /(/;,). 
For more on this viewpoint of categories and functors, the reader can consult 

4. Communication between clogs 

The world is inherently heterogeneous. Different individuals in the world nat- 
urally have different world-views — each individual has its own perspective on the 



■^If we let / = /i ; /2 ;■••;/„ and /' = /{; then wc often write {/ = /') : i ^ j to 

denote the fact that these paths are equivalent. 

^By an individual we mean either an individual person acting on their own, a community 
acting as a single entity, a software agent, etc. Later in this section we will use the notion of a 
community acting as a distributed collection of linked, yet independent, individuals. 
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world. The conceptual knowledge (information resources) of an individual rep- 
resents its world-view, and is encoded in an ontology log, or olog, containing the 
concepts, relations, and observations that are important to that individual. An olog 
is a formal specification of an individual's world-view in a language representing 
the concepts and relationships used by that individual. In addition to the formula- 
tion of an expressive language, a specification needs to contain axioms (facts) that 
constrain the possible interpretations of that language. 

Since the ologs of different individuals are encoded in different languages, the 
important need to merge disparate ologs into a more general representation is dif- 
ficult, time-consuming and expensive. The solution is to develop appropriate com- 
munication between individuals to allow interoperability of their ologs. Commu- 
nication can occur between individuals when there is some commonality between 
their world-views. It is this commonality that allows one individual to benefit from 
the knowledge and experience of another. In this section we will discuss how to 
formulate these channels of communication, thereby describing a generalized and 
practical technique for merging ologs. 

The mathematical concept that makes it all work is that of a functor. A functor 
is a mapping from one category to another that preserves all the declared structure. 
Whereas in Definition 13 . 2 . 31 we defined a functor from an olog to Set, here we will 
be discussing functors from one olog to another. 

Suppose we have two ologs, C and 2?, that represent the world- views of two 
individuals. A functor F: C ^ V is basically a way of matching each type (box) of 
C to a type of V, and each aspect (arrow) in C to an aspect (or path of aspects) in T). 
Once ologs are aligned in this way, communication can occur: the two individuals 
know what each other is talking about. In fact, mathematically we can show that 
instance data held in C can be transformed (in coherent ways) to instance data held 
in P, and vice versa (see |Spi2| ). In simple terms, once individuals understand each 
other in a certain domain (be it social, mathematical, etc.), they can communicate 
their views about it. 

While the basic idea is not hard, the details can be a bit technical. This section 
is written in a more formal and logical style, and is decidedly more difhcult than 
the others. For this section only, we assume the reader is familiar with the notion 
of fibered categories, colimits in the category Cat of categories, etc. We return 
to our more informal style in Section [5l where we discuss how an individual can 
author a more expressive olog. 

4.1. Categories and their presentations. We never defined categories in this 
paper, but we defined ologs and said that the two notions amounted to the same 
thing. Thus, we implied that a category consists of the following: a set of objects, 
a set of arrows (each pointing from one object to another), and a congruence rela- 
tion on pathsQ This differs from the standard definition of categories (see [Mac]), 
which replaces our congruence relation with a composition rule and associativity 
law (obtained by taking the categorical quotient). One could say that an olog is a 
presentation of a category by generators (objects and arrows) and relations (path 
congruences). Any category can be resolved and presented in such a way, which we 



congruence relation on paths is an equivalence relation on paths that respects endpoints 
and is closed under composition from left and right (see the axioms in l20l l. 



OLOGS: A CATEGORICAL FRAMEWORK FOR KNOWLEDGE REPRESENTATION 19 



will call a specification. Likewise any functor can be resolved and presented as a 
morphism between specifications. 

In fact, this presentation form for categories (and the analogous one for functors) 
is preferable for our work on communication between ologs, because it separates 
the strictly graphical part of an olog (its types and aspects, regarded as the olog 
language) from the propositional part (its facts, regarded as the olog formalism). 
This presentation form is standard in the institutions [GB| and information flow 
[BSj communities, since it separates the mechanism of flow from the content of flow; 
in this case the formal content. Our work here applies the general theories of insti- 
tutions and information flow to the specific logical system that underlies categories 
and functors demonstrating how this logical system can be used for knowledge 
representation. Using the presentation forms for categories and functors, we show 
how communication between individuals is effected by the flow of information along 
channels. 

4.2. The architecture underlying information systems. We think of a com- 
munity of people, businesses, etc. in terms of the ologs of each individual partici- 
pant together with the information channels that connect them. These channels are 
functors between ologs, which allow communication to occur. The heterogeneity of 
multiple differing world-views connected through such links can lead to a flexibil- 
ity and robustness of interaction. For example, heterogeneity allows for multiple 
schemas to be employed in the design of database systems in particular, and mul- 
tiple languages to be employed in the design of knowledge representation systems 
in general. 

For any olog, consider the underlying graph of types and aspects. We regard this 
graph as being the language of the olog, □ with the facts of the olog being a subset 
of all the possible assertions that one can make within this language. Any two 
ologs with the same underlying graph of types and aspects have the same language, 
and since the facts of each olog are expressed in the same language, they can be 
"understood" by each other without translation. As such, wc think of the collection 
of all ologs with the same language (underlying graph) as forming a homogeneous 
context, with the ologs ordered in a specialization-generalization hierarchy. 

Whereas an olog represents (the world-view of) a single individual, an infor- 
mation system (of ologs) represents a community of separate, independent and 
distributed individuals. Here we consider an information system to be a diagram 
of ologs of some shape I; that is, a collection of ologs and constraints indexed by a 
base category I. The parts of the system represent either the ologs of the various 
individuals in the system or common grounds needed for communication between 
the individuals. Each part of the system specifies its world-view as facts expressed 
in terms of its language. The system is heterogeneous, since each part has a sep- 
arate language for the expression of its world-view. The morphisms between the 
parts are the alignment (constraint) links defining the common grounds. 

''We take an agnostic approach to foundations here. With the presentation form, we show 
how categories and functors are definable in terms of sets and functions, indicating how category 
theoretic concepts could be defined in terms of set theory. However, we fully understand that Set, 
the category of sets and functions, is but one example of a topos, indicating how set theoretic 
concepts could be defined in terms of category theory. 

^For the expert, this refers to the sketch logical system Sk, in its various manifestations. 

^Section 14.41 indicates how natural languages can be encoded into ologs. 
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As will be made clear in a moment, there is an miderlying distributed system 
consisting of the language (underlying graph) for each component part of the in- 
formation system and a translation (graph morphism) for each alignment link. We 
can think of this distributed system as an underlying system of languages linked by 
translating dictionaries. This distributed system determines an information chan- 
nel with core language (graph) and component translation links (graph morphisms) 
along which the specifications of each component part can flow to the core. We 
can think of this core as a universal language for the whole system and the channel 
as a translation mechanism from parts to whole. At the core, the direct flow of 
the component specifications are joined together (unioned) and allowed to interact 
through entailment. The result of this interaction can then be distributed back to 
the component parts, thereby allowing the separate parts of an information system 
to interoperate. 

In this section, we will make all this clear and rigorous. As mentioned above, 
we will work with category presentations (here called specifications) rather than 
categories. We will discuss the homogeneous contexts called fibers in detail and 
give the axioms of satisfaction. Wc will then discuss how morphisms between 
graphs (the translating dictionaries between the ologs) allow for direct and inverse 
information flow between these homogeneous fiber contexts. Finally, we discuss 
specifications (also known as theories) and the lattice of theories construction for 
ontologies. 

In Section 231 we will discuss how the information in ologs can be aligned by the 
use of common grounds. This alignment will result in the creation of information 
systems, which are systems of ologs connected together along functors. We will 
discuss how to take the information contained in each olog of a heterogeneous system 
and integrate it all into a single whole, called the fusion olog. Finally we will discuss 
how the consequence of bringing all this information together, and allowing it to 
interact, can be transferred back to each part of the system (individual olog) as a set 
of local facts entailed by remote ologs, allowing for a kind of interoperability between 
ologs. In Section 14.41 we will discuss conceptual graphs and their relationship to 
ologs. 

4.2.1. Fibers. A graph G contains types as nodes and aspects as edges. The graphs 
underlying an olog is considered its language. Any category C has an underlying 
graph \C\. In particular, |Set| is the graph underlying the category of sets and 
functions. Olog (12) has an underlying graph containing the three types '"person"', 
'"person-pair"' and '"woman^ and the three aspects 'has a parent', 'woman' and 
'has as mother'. Olog (17) has an underlying graph containing the three types 
'"employee"', '"department^, and '"string"' and the six aspects 'manager', 'works in', 
'secretary', 'name', 'first name' and 'last name'. Let eqn{G) denote the set of all 
facts (equations) that are possible to express using the types and aspects of G. A 
G-specification is a set E C eqn{G) consisting of some of the facts expressible in G. 
The singleton set with the one fact that "the female parent of a person is his/her 
mother" is a specification for the graph of Olog (12). The set with the two facts 
that "the manager has the same department as any employee" and "the secretary 
of a department is an employee in that department" is a specification for the graph 
of Olog (17). Let spec{G) denote the collection of all G-specifications ordered by 
inclusion Ei Q E2. 
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4.2.2. Satisfaction. It will be useful here to define an instance of a graph G, instead 
of an instance of a category C. An instance of a graph populates the graph by 
assigning instance data to it. An instance of a graph G is a graph morphism 
D . G I Set I mapping each type a; in G to a set D{x) of instances and mapping 
each aspect e: x ^> y in G to an instance function D{e): D{x) — ^ D{y). Using 
database terminology, we also call D a key diagram, since it gives the set of row 
identifiers (primary keys) of tables and the cell contents defined by key maps. 

A key diagram Z): G — > |Set| satisfies (is a model of) a G-fact e G eqn{G) 
(see Definition I3.2.3p . symbolized D \=q e, when we have an equality of functions 
^*(eo) = D*{€i). We also say that e (holds in) is true when interpreted in D. 
An identity (/ =g f)'- i j holds in all key diagrams (hence, is a tautology), 
and vice-versa for any set A e |Set| a constant key diagram A(A): G |Set| 
satisfies any fact e £ eqn{G). A key diagram D: G |Set| satisfies (is a model 
of) a G-specification E, symbolized D |=g E, when it satisfies every fact in the 
specification. For any graph G, a G-specification E entails a G-fact e, denoted by 
E \-a e, when any model of the specification satisfies the fact. The consequence E' 
of a G-specification E is the set of all entailed equations. The consequence operator 
(— )• is a closure operator, and the consequence of a specification is a congruence. 
For any G-specification E, entailment satisfies the following axioms. 

(basic) If E contains the equation e, then E entails e. 
(reflexive) E entails the equations (/ —g /) : i j for any path /: i ^ j. 
(symmetric) If E entails the equation (/i —q /2) ■ i — ^ j, then E entails the equation 
if 2 =G fl)-- i ^ 3- 

(transitive) If E entails the two equations (/i —q /2) : i ^ j and (/2 — g /a) : i — ^ J, 
(20) then E entails the equation (/i —q /a) : i ^ j- 

(compositional) If E entails the two equations (/i —q /2) ■ i ^ j and [gi —g 32) : J — > fc, 
then E entails the equation (/i ; gi —g /2 ; 32 ) ■ * — ^ fc- 
(bi-closed) If E entails the equation {gi —g 92) : J — ^ ^, then E entails the equations 
(/ ; 91 =G / ; 32) : i k and (gi \h =g 92 ; h) : j -> I for any left 
composable path /: i ^ j and any right composable path h: k ^ I. 

These are converted to inference rules in Table [H To construct E' , we first take the 
reflexive, symmetric, and transitive closure E* of E (so that E* is a G-specification 
and also the smallest equivalence relation containing E), and then we get E' by 
closing up under composition on left and right. We extend specification inclusion 
with the entailment order, where Ei <q E2 when Ei entails each equation in E2] 
that is, when El D E2 or equivalcntly when E{ 3 iSj. The statement "£'1 <g £'2" 
asserts that Ei is at least as specialized as i?2- The entailment order {spec{G), <g), 
which is a specialization-generalization order, represents a local version of the "lat- 
tice of theories" construction of Sowa |Sow2| (see Section 14.2.51). The opposite 
entailment order fbr{G) = {spec{G), >g) is called the fiber orderQ In the latticJ^ 
spec{G), the meet is union A = U and the join is intersection V = fl; whereas in 
the lattice fbr{G), the join is union V = U and the meet is intersection A = fl. 
Any specification E is entailment equivalent to its consequence E = E* . A specifi- 
cation E is closed when it is equal to its consequence E = E*. There is a one-one 
correspondence between closed G-specifications and categories over graph G. The 

^For consistency in discussion, wo follow the terminology of formal concept analysis |GW) . 
information flow IBSI and the theory of institutions IGBI . This includes the polarity induced by 
concept lattices and the directionality of infomorphisms. 

^'^This is a complete preorder, loosely called a "lattice". 
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conceptual intent of a key diagram D, implicit in satisfaction, is the closed specifi- 
cation int{D) consisting of all facts satisfied by the key diagram. Hence, D |=g E 
iS E C int{D) iff int{D) <g 

4.2.3. Elementary flow. A graph morphism : Gi — >■ G2 maps the types and as- 
pects of Gi to the types and aspects of G2. Graph morphisms are the transla- 
tions between ologs. A functor J-:Ci — > C2 has an imderlying graph morphism 
I J^l : \Ci\ — > IC2I. For any graph morphism H : Gi ^ G2, there is a fact function 
eqn{H): egn(Gi) eqn{G2) that maps a Gi-equation (/i =Gi fi)'- *i ^ ji to 
the G2-equation {H*{fi) =g^ H*{f[)) : H{ii) — > H{ji), and a key diagram functor 
dgm{H): dgm{G2) — > dgm{Gi) that maps a key diagram D2: G2 ^ |Set| to the 
key diagram H 0D2: G2 ^ |Set|I!l The fact function is the fundamental unit of 
information (formal) flow for ologs, and the key diagram functor is the fundamental 
unit of semantic flow for ologso Formal flow is adjoint to semantic flow — satis- 
faction is invariant under flow: dgm{H){D2) \=Gi ci 12 ^2 \=G2 G9n(i7)(ei) for 
any graph morphism H: Gi G2, source fact ei and target diagram D2. Spec- 
ifications can be moved along graph morphisms by extending the fact (equation) 
function. For any graph morphism H: Gi ^ G2, define the direct flow opera- 
tor dir{H) = peqn(H) : spec(Gi) — s- specfG?]^ and the inverse flow operator 
inv{H) = eqn{H)~^ {{-)') : spec{G2) spec(Gi). Direct and inverse flow are 
adjoint monotonic functions {dir{H) H inv{H)) : fbr{Gi) — > fbr{G2) w.r.t. fiber 
order: dir{H){Ei) >g2 E2 Ei >Gi inv{H){E2)- For any graph morphism 
i/: Gi — >■ G2, any Gi-specification Ei^ and any G2-specification E2, entailment 
satisfies the following axioms. 

(direct flow) If E-^ entails the equation (/ — Gi /') ■ ^ ^ then dir{H){Ei) entails 
the equation =02 H'(f[)): H{h) -> H{ji). 

(inverse flow) If E2 entails the equation {H* (f) —G2 (/'))■ -^(0 ~^ ^U)^ then 
inv{H){E2) entails the equation (/ — d /')■ ~^ j- 

These are converted to inference rules in Table [TJ A graph morphism H : Gi — !■ G2 
defines a consequence operator {-)*" = dir{H) o inv{H) on the fiber preorder 
/br(Gi), where E^ >g, E^ >g, E*" . 

4.2.4. Specifications. A specification S = {G,E) is an indexed notion consisting of 
a graph G and a G-specification E G spec{G). It is sometimes convenient to use 
the symbol '5' in place of 'E^; for example, to say that "5 G spec{G)". A category 
C can be resolved and presented as a specification spec{C) = (G, E) consisting of 
the underlying graph G = |C| containing the types and aspects of C and the col- 
lection E of all facts that hold in C In the other direction, any specification S 
induces a (quotient) category cat{S). Olog (12) and Olog (17) are described as 
specifications in Section [4.2.11 A specification morphism H : {Gi,Ei) — > (G2,i?2) 
is a graph morphism H : Gi G2 that preserves entailment: Ei f-gj ei implies 
E2 l~G2 Gqn{H){ei) for any ei G eqn{Gi); or equivalcntly that satisfies the ad- 
jointness conditions, dir{H){Ei) >g2 ^2 iff i?i >Gi inv{H){E2). Being a graph 
morphism, it maps types to types and aspects to aspects. Moreover, it also maps 
facts in Ei to facts in E2; that is, it preserves all the declared structure. A 
functor J-: Ci — > C2 can be resolved and presented as a specification morphism 

^^This is the first step in the algebraization of Tarski's "semantic definition of truth" | Ken4| . 
^^The composition of graph morphisms is written in diagrammatic order. 
^•^This is so, at the abstraction of institutions |Ken3) . 
^^The symbol p denotes the power-set operator. 
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T: spec{Ci) spec{C2)- Hence, the presentation form for a functor does exactly 
what the functor does. The fibered category of specifications Spec has specifica- 
tions as objects and specification morphisms as morphisms. Thus, it is defined in 
terms of information flow. There is an underlying graph functor gph : Spec — >■ Gph 
from specifications to graphs (G, E) n- G. The subcategory over any fixed graph 
G is the fiber fbr{G); because of the opposite orientation, we say that "the cat- 
egory of specifications points downward in the concept lattice". Throughout this 
section we identify ologs with specifications and olog morphisms with specification 
morphisms. 

4.2.5. The lattice of theories construction. Sowa's "lattice of theories" construction 
(LOT) describes a modular framework for ontologies [Sow2| . The Olog formalism 
follows the approach to LOT described in |IFF2|Pl In the Olog formalism, LOT is 



locally represented by the entailment preorders spec{G), and globally represented 
by the category of specifications Spec. We follow the discussion in section 6.5 
"Theories, Models and the World" of Sowa [Sow2| . From each olog (specification) 
in the "lattice of theories", the entailment ordering defines paths to the more gen- 
eralized ologs above and the more specialized ologs below. Sowa defines four ways 
for moving along paths from one olog to another: contraction, expansion, revision 
and analogy. 

Contraction: Any olog can be contracted or reduced to a smaller, simpler 
olog, moving upward in the preorder spec{G), by deleting one or more 
facts. 

Expansion: Any olog can be expanded, moving downward in the preorder 
spec{G), by adding one or more facts. 

Revision: A revision step is composite, moving crosswise in the preorder 
spec[G)\ it uses a contraction step to discard irrelevant details, followed by 
an expansion step to added new facts. 

Analogy: Unlike contraction and expansion, which move to nearby ologs in 
an entailment preorder spec{G), analogy moves to an olog in a remote 
entailment preorder in the category Spec via the flow along an underlying 
graph morphism H : Gi ^ G2 by systematically renaming the types and 
aspects that appear in the facts: any olog Ei in spec{Gi) is moved (by 
systematic renaming) to the olog dir{H){Ei) in spec{G2)- 
According to Sowa, the various methods used in nonmonotonic logic and the oper- 
ators for belief revision correspond to movement through the lattice of theories. 

4.3. Alignment and integration of information systems. 

4.3.1. Common ground. Given the world- views of two individuals, as represented 
by ologs Si = {Gi,Ei) and 1S2 = (G2, i?2), there is httle hope that one of them com- 
pletely contains the other (even after allowing for renaming of types and aspects) , 
and there is correspondingly little chance of finding a meaningful olog morphism be- 
tween the two. Instead, in order to communicate the two individuals could attempt 
to find a common ground, a third olog S = (G, E) and meaningful morphisma^n 



^'^The IFF term 'theory' is replaced by the Olog term 'specification' or 'olog'. 

^^Roughly speaking, an olog morphism _F: C — > is meaningful when for each type X in C, 
every intended instance of X in C would be considered an instance of F{X) by the author of V 
(in which case we say the intention for types is respected), and in a similar way the intention for 
aspects is respected. Precisely speaking, if I : C — >■ Set and J : T) Set are instance data for 
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Hi: S ^ Si and H2 ■ S ^2 This connection is a 1-dimcnsional knowledge 
network Si 4— ^ S -—^ S2 of shape ••<—•—)•• called a span (in Spec), where each 
node is an clog and each edge is a morphism between ologs. The requirements of 
this span are that dir{Hi){E) >Gi Ei and dir{H2){E) >g2 E2, two requirements 
involving local flow. Equivalently, that E >g inv{Hi){Ei) Vg inv{H2){E2)- The 
latter precise expression can be rendered in natural language as "the world-view of 
the common ground is contained in the combined world- views of the two individu- 
als". The various local direct /inverse flows allow world- views to be compared. Such 
a common ground can be expanded and improved over time. The basic idea is that 
one individual can attempt to explain a new idea (type, aspect or fact) to another 
in terms of the common ground. Then the other individual can either interpret this 
idea as they already have, learn from it (i.e. freely add it to their olog), or reject 
it. We view an olog morphism Hi : Si ^ S2 as an atomic constraint (alignment) 

link between iSi and ^2 We view a common ground span Si < — - S — ^ ^2 
as a molecular constraint between Si and ^2, which is weakest when 5 = and 
strongest when iSi ~ S ^ S2- 

4.3.2. Systems of ologs. In the general case, more than two individuals will share 
a common ground. For example, companies that do business together may have a 
common-ground olog as part of a legal contract; or, the various participants at a 
conference will have some common understanding of the topic of that conference. 
In fact, for any finite set of ologs X = {iSi, S2, ■ ■ ■ ,Sn}, there should be a common 
ground world-view (even if empty), say Sx- If Y C X is a subset, then there should 
be a map Sx — >■ because any common understanding held by the individuals in 
X is held by the individuals in Y. For example, the triangular-shaped diagram 

(21) Si 




S2 ^ ^23 ^ ^3 



represents three individuals {1,2,3}, their ologs {iSi, 1S2, 53}, their pair-wise com- 
monality ologs {i5i2, 5i3, ^23}, and their three-way commonality olog iSi23. This 
diagram, which stands for the interaction between individuals {1,2,3}, does not 
stand alone, but is part of an intricate web of other ologs and alignment constraints. 
In particular, individuals 1 and 3 may be part of some different interacting group, 
say of individuals {1,3,6,7}, and hence the right edge of the diagram would be 
part of some tetrahedron-shaped diagram with vertices {1,3,6,7}. If we take the 
point-of-view that "a collection of ologs representing the world- views of various in- 
dividuals" is a system, then we can think of the ologs as being the types of that 

C and "D, then F is meaningful relative to / and J if one can exhibit a natural transformation 
fj. : / => F o J as in |Spi2| . 

^^A common ground olog is also called a reference ontology in knowledge representation. 

^^This is so, at the abstraction of institutions IKen3| . 
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system, the morphisnis connecting the ologs as being the aspects of that system, 
with the shape of a system being its underlying graph. In essence, we can apply 
ologs to themselves. In the system represented by diagram (|2ip . there are seven 
types {Si,S2,S3,Si2,Si3,S23,Si23} and nine aspects {• • ■ ,^123 ^13, . . . }, and 
the shape looks like this 




In addition, we can introduce certain facts to represent the meaning of that system 
and then enforce those facts. 

A distributed system is a diagram (functor) Q : I ^ Gph of shape I within the 
ambient category Gph. As such, it consists of an indexed family {G„ | n G 1} of 
graphs together with an indexed family {Ge ■ Gn Gm \ (e : n ^ m) S 1} of graph 
morphisnis. Let Dist(I) denote the collection of distributed systems of shape I. 
An information system is a diagram 5 : I Spec of shape I within the ambient 
category Spec. As such, it consists of an indexed family {Sn = (G„, £"„) | n e 1} of 
ologs together with an indexed family {Se '■ Sn — > Sm | (e : n — >■ m) G 1} of olog mor- 
phisms. Some of these ologs might represent the world- views of various individuals, 
whereas others could be common grounds; also included might be portals between 
individual ologs and common grounds, as in the CG example of Section 14.41 Let 
Info(I) denote the collection of information systems of shape I. An information sys- 
tem S with component ologs iS„ — (Gn, £„) has an underlying distributed system Q 
of the same shape with component graphs G„ for n e I. For any distributed system 
Q, let infoi{Q) denote the collection of information systems over Q of shape I. There 
is a pointwise entailment order S <g S' on infoi{Q) when component ologs satisfy 
the same entailment ordering £"„ <g„ for n G I, and by taking the coproduct 
there is a pointwise entailment order on Info(I) = lJgeDist{i) ^'^/''i(^)- ^ constant 
distributed system A(G) G Dist(I) is a distributed system A(G): I —5- Gph with 
the same language G for any index n £ 1. Any constant distributed system defines 
join and meet monotonic functions Vg' Ag • ~^ fl^iG) mapping an 

information system S G m/oi(A(G)) to the join and meet ologs \/ S = Unei 
and /\S = Hriei in fbr{G). The join monotonic function is adjoint to the con- 
stant monotonic function : fbr{G) ^> m/oj(A(G)) that distributes an olog 
S' G fbr{G) to the various locations n G I forming a constant information sys- 
tem A{S') G m/oi(A(G)), since \/ S >g S' iS S >]^^g) ^('^') system 
S G infoi{A{G)) and any olog S' G fbr{G). 

4.3.3. System morphisms. Just as ologs are linked by morphisms, information sys- 
tems are also linked by morphisms. For these there is the new complication of shape. 
In this paper we define fixed-shape system moorphisms, but a more general defi- 
nition would allow the shape to vary. A distributed system morphism 9: Q ^ Q' 
in Dist(I) consists of a collection {On'- Gn — > G^ | n G 1} of component graph 
morphisms, which are systematically coordinated in the sense that they satisfy the 
naturality conditions GeoOm — 9n°G'^ for any indexing link e : n ^ m in I. A direct 
flow operator diri{6) : infoi{Q) — > info-^iQ') along can be define, which maps an 
information system S G infoi{Q) to an information system diri{6){S) G info^{Q') 
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defined by diri{e){S)n = dir{0n)iEn) for 71 e 10 An inverse flow operator 
invi{6) : infoi{Q') infoi{Q) can similarly be defined. Direct and inverse 
flow arc adjoint monotonic functions {diri{9) H invi{9)) : infoj{Q) infoi{Q'), 
since diri{d){S) >g, S' iff S >g i'nvi{6){S'). An information system morphism 
: 5 iS' in Info(I) consists of a collection {0„ : Sn iS'i | n G 1} of component 
olog morphisms, which are systematically coordinated and preserve alignment in 
the sense that they satisfy the naturality conditions Se o 9^ = On ° S'^ for any in- 
dexing link e : 71 — !■ m in I; equivalently, is a morphism between the underlying 
distributed systems 9: Q =4> Q' and the direct flow of S is at least as general as 
S': diri{9){S) >g, S' . The ordering S >g S' is an information system morphism 
9: S ^ S' with identity component translations 9„ = ido^ for each index ri G I. 

4.3.4. Channels. We continue with our systems point-of-view. Since we have rep- 
resented the whole system as a diagram S of parts (ologs) Sn with part-part re- 
lations (alignment constraints) 5„ Sm, we also want to represent the whole 
system as an olog C with part-whole relations iS„ — >■ clf^ An information chan- 
nel (7: A(C),C} consists of an indexed family {7„: G„ — > C | n G 1} of 
graph morphisms called flow links with a common target graph C called the core 
of the channel. A channel (7, C) covers a distributed system Q of shape I when the 
part-whole relationships respect the alignment constraints (arc consistent with the 
part-part relationships): 7„ = Ge o 7™ for each indexing morphism e: tt, — >■ to in 
I. A covering channel is a distributed system morphism 7: ^ ^ ^(C*) in Dist(I) 
from distributed system Q to constant distributed system A(G): I Gph. Such 
a channel defines a direct flow operator diri{'-^) : infoi{Q) m/oj(A(G)) and 
an inverse flow operator mt>i(7) : m/oj(A(G)) — > infoi{Q). For any two cover- 
ing channels (7', C) and (7, C) over the same distributed system a reflnement 
H : (7', C") — > (7, G) is a graph morphism between cores H : C' ^ C that respects 
the part- whole relationships of the two channels: ^'^o H — 7„ for n G I. In such 
a situation, we say the channel (7^ C') is a refinement of the channel (7, G). A 
channel (i, WQ) is a minimal coveio or optimally refined covering) channel of a 
distributed system Q when it covers Q and for any other covering channel (7, G) 
there is a unique reflnement [7, G] : U S — > G from (i, ]J Q) to (7, G). 

4.3.5. System flow. In order to represent an information system S = {Sn — ^ Sm} 
as a single olog JJ S, called the fusion of S, with part-whole relations iS„ ^ ]J 5, we 
follow the colimit theorem of |TBG| by recognizing the following three properties. 

• Optimal channels exist for any distributed system Q. 

• fbr{G) is a complete preorder for any graph G, loosely called a "lattice". 

• For any graph morphism H : Gi — >■ G2, direct and inverse flow are adjoint 
monotonic functions {dir{H), inv{H)) : fbr{Gi) fbr{G2). 

Let Q G Dist(I) be a distributed system of shape I with optimal channel (i, U^). 
The optimal core = is called the sum of the distributed system Q, and 
the optimal channel components (graph morphisms) {t„ : G„ — > ]J | n G 1} are 
called flow links. There is a direct system flow monotonic function (sec Figure [T|) 

^^Well-defined, since dir(G^)((iir(6»„)(£;„)) = dir{em)<,dir{G^){En)) >m dir(era)(Em). 
^"The theory of part-whole relations is called mereology. It studies how parts arc related to 
wholes, and how parts are related to other parts within a whole. 
'^-'^Information flow terminology IBSI . 
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S diri{i) 



info^iG) H m/oi(A(e)) H fbr{g) 9 U-S 

" J 



5* invi(i) 



Figure 1. System Flow 

dir (i^g-j ~ diri{L) ■ : infoi{Q) fbr{Q). Direct system flow has two steps: (i) 
direct (fixed shape) system flow of an information system along the optimal channel 
(Dist(I)-morphism) l: Q => A{Q) and (ii) lattice join combining the contributions 
of the parts into a whole. In the opposite direction, there is an inverse system flow 
monotonic function (see Figured]) inv(^ig-^ = • invi{L): fbr[Q) — > infoj{Q). 

Inverse system flow has two steps: (i) mapping an olog with core language Q to 
a constant information system over A (5) with shape I by distributing the olog 
to the locations n G I, and (ii) inverse (fixed shape) system flow of this constant 
information system back along the optimal channel l: Q ^ ^{Q)- Direct system 
flow is adjoint to inverse system flow {dirijg^ H invi^ig-^): infoi{Q) — > fbr{Q), 
since the composition components are adjoint. For any distributed system Q G 
Dist(I) with optimal core = any information system S G infoi{Q), and 

any olog S G fbr{Q), entailment satisfies the following axioms. 

(direct flow) If En entails the equation (/ —Gn f ) '• ^ j j then rfir^i entails 
the equation —g t-^if'))' i^ni^) ~^ f^nU) for n 1. 

(inverse flow) If S entails the equation (t*(/) — ^ '^n(f'))'- f-nij), then 

inv {S)n entails the equation (/ —Gn /') ■ ^ ~^ J foi' ^^ly n G I. 

These are converted to inference rules in Table [TJ Information fiow can be used to 
compute the fusion olog for an information system and to define the consequence 
of an information system. Fusion is direct system flow, and consequence is the 
composition of direct and inverse system flow. Let S G infoi{Q) be any information 
system. The fusion jJS = dir(j,g){S) = {]1Q,\I n^idir{Ln){ En)) € fbr{Q) is an 
olog that represents the whole system in a centralized fashion [Ken2| , [Ken3| . The 
consequence 5^* g,^ = inv ij^g){dir ij_^g)[S)) = inv tj^g){\[S) = {inv{Ln){WS) \ n G 
1} G infoi{Q) is an information system that represents the whole system in a 
distributed fashion |Ken3j . It is inverse flow of the fusion olog along the optimal 
channel, transfering the entailed facts of the whole system to the component parts. 

The consequence operator (-)*, which is defined on information systems, is a 
closure operator on the complete preorder m/oj(CJ), and by taking the coproduct 
it is a closure operator on the complete preorder Info(I) = IJeGDist(i) ^'^/''i(^) 
: (increasing) S > S* , (monotonic) S > S' implies S* > S'* and (idempotent) 
S** = S* .]l3 Pointwise entailment order < on Info(I) is only a preliminary order, 
since it does not incorporate interactions between system component parts. System 



^^By allowing system shape to vary, channels can be generalized to morphisms of distributed 
systems. Then a notion of relative fusion (direct system flow) can be defined in terms of left Kan 
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(/ =G /) : i 3 
ifl =G f2)-- i ^ j 
(/2 =G /l) : * -> j 

ifl =G /2) : i j: (/2 =G h): i j 

ifl =G fa): i ^ j 
ifl =G /2) : i ^ j. {gi =G g-i)- j ^ k 
(/l ; 91 =G /2 ; 92) : i fe 
(91 =G 92) : j -> fe 
(/ ; 91 =G / ; 92) : i fe, (91 ; =G 92 : /l) : i -> i 

ifl =Gi fi)- '1 

=G2 -f^*(/0): -ffC^l) -f^bl) 

=G2 -ffl^l) HU,) 
ifl =Gi : il jl 

(/=G„ /'): 
«(/) =s <(/')): ^'..0') 

(/ =G„ /'): « ^ J 

Table 1. Inference Rules 

entailment order < on Info (I) is defined by Si ^ ^2 when S* < S*; equivalently, 
S* < 52- Pointwise order is stronger than system entailment order: Si < S2 implies 
Si ^ ^2. This is a specialization-generalization order. Any information system S 
is entailment equivalent to its consequence S = S* . An information system S is 
closed when it is equal to its consequence S = S* . 

The whole effect of taking the system consequence may be greater than the 
sum of its parts, in the sense that Sn >n S*'" >n \l rn'''''^'^i'^n){d.ir{Lm){Sm)) 
S* for any n G I, since separate parts may have a productive interaction at the 
channel core. A final part of an information system is a part with no non-trivial 
constraint links from it. (The graphical subsystem beneath) nonfinal parts are 
necessary for the alignment of information systems, resulting in the equivalencing of 
types and aspects through quotienting. However, because of the covering condition 
t„ = Ge o and the entailment order dir{Ge){En) >m Em for constraint links 
iSg : Sn — >■ Sm, only the fact(ual) content of final parts of information systems are 
necessary to compute the system fusion and consequence. 

4.3.6. General examples. Here are some examples of system fusion/consequence. 

extension, and a notion of relative system consequence can be defined as the composition of direct 
followed by inverse system flow. 



equivalence: (reflexive) 



(symmetric) 



(transitive) 



algebra: (compositional) 



(bi-closed) 



morphic flow: (direct) 



(inverse) 



system flow: (direct) 



(inverse) 
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• An information system S with a constant underlying distributed system, 
Gi = G for all n El, gathers together all the component parts of the infor- 
mation system and forms their consequence. It has identity flow links {6„ = 
ido- G G — Y[Q I^Sl}, component join fusion ]J5 = Vnei'^n = 
(G, i?„}, and constant system consequence S* ~ (Vn'ei'^n')* ^'-'^ 

n el. 

• A discrete information system S = {5„ = {G„,E„) | n G 1} with no 
constraint links Gp '■ Sn <Sm for n m, has coproduct injection flow links 
L„ '■ Gn — +nei non-rcstricting fusion, and inverse flow projecting back 
to individual component consequence S* = 5,* for all n g I. No alignment 
(constraint) links means no interaction. 

• An information system S = {Si <— S S2} consisting of a single com- 
mon ground S = {G,E) between two component ologs Si = {Gi,Ei) and 

S2 ~ (G2,-E'2), with underlying distributed system (span) Q = {G\ 

G G2}, has pushout injection flow links Gi YlQ <^ G2, direct 
image union fusion ]JS = {]JG, d,ir{ti){Ei) Li dir{L2){E2)), and system 
consequence components S* = (G„, mt;(t„)(£/ir(ti)(£'i) U <iir(t2)(i?2))) 
for n = 1,2. The flow links will quotient any types and aspects that are 
connected through the common ground allowing for the approprate interac- 
tion in the fusion consequence {dir{Li){Ei)\Jdir{i,2){E2))' , then the inverse 
flow will reconnect this with the component types and aspects. 

4.4. Conceptual graphs. The conceptual graph formalism (CG) for knowledge 
representation |Sow2| , was initially formulated to represent database systems (DBS), 
but is now used in natural language processing (NLP) and first-order logic (FOL). 
Verbs in NLP can often be represented relationally by star(-shaped conceptual) 
graphs. For example, the sentence "John is going to Boston by bus" might be 
represented by the conceptual graph 



(22) I Person: JohrTI ^ (go) 



City; Boston 



I Bus I 

In a sentence of natural language, thematic roles are semantic descriptions of the 
way (the entities described by) a noun phrase functions with respect to (the action 
of) the verb. These entities are the participants in the occurrent expressed by the 
verb. For the action of 'going' in the above sentence there are three participants 
and hence three thematic roles. 'John' plays the role of the agent of the action, 
a 'Bus' is the instrument used in the action and 'Boston' is the destination of the 
action. Translations using thematic roles can be used to align two ontologies with 
respect to a common ground. A CG-style translation of conceptual graph (|22p 
would replace the verb relation 'going' with a concept 'Go' and replace the edges 
that form the signature of the 'going' relation with binary relations for the three 
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roles 'agent', 'instrument' and 'destination'. 

I Person: John | < ( ^agcnt^ 1 Go | - 



(23) 



City: Boston 



However, the case relations that semantically describe the thematic roles should be 
viewed as functional in nature; that is, for any instance of the action of a sentence's 
verb there is a unique entity described by a noun phrase of the sentence. When 
this semantics is respected, the translation to thematic roles becomes a process 
of "linearization", which is best described abstractly as: (1) the identification of 
relation types with entity types, (2) the translation of a sorted multiarity relation to 
a span of functions, one function for each role, and (3) the functional interpretation 
of thematic roles. 

The Olog formalism, which also represents DBS and NLP, is a version of equa- 
tional logic. Both the Olog and CG formalisms were designed as graphical rep- 
resentations. However, the CG formalism is binary and relational, whereas the 
Olog formalism is unary and functional. The CG formalism is binary since it has 
two kinds of type, concepts and relations; it is relational in the way it interprets 
edges. The Olog formalism is unary since it has only one kind of type, the abstract 
concept; it is functional in the way it interprets aspects (edges). However, much 
of the semantics of the CG fagnalism can be transformed to the Olog formalism 
by the process of linearizatioro, thereby gaining in efficiency and conciseness. For 
example, conceptual graph ([^ can be linearized to the olog graplo 



(24) 



0- 



T John 


Person 


agent ^ 


J ' 





j deat 


City 


Boston |- 







I Bus I 

Since olog aspects are interpreted functionally, the functional nature of thematic 
roles is respected. In this manner, the olog formalism could be used to replace 
the CG representation of ontologies. For example, a community (acting as an 
individual) could build its ontology C from ground up by aligning it with some top- 
level reference ontology T (such as in the appendix of |Sow2| ) . thereby importing 
some formal semantics from T. The following fragment demonstrates how this 
works. 

Assume that ontology T contains the concept of "spatial process" as represented 

by the general concept type '~Spatial-Process~' with aspects '~Spatial-Process~' 

'"Agent"', ^Spatial-Process"' -^^^ '"Vehicle"' and '"Spatial-Process"' '"Location"'. 
At some stage assume that the community ontology C has specified the concept 
type orderings '"Person"' < '"Agent"', ^Bus"' < ^Vehicle"' and '"City"' < '"Location"' 

with corresponding injective aspects '"Person"' ^Agent"', '"Bus"' ^ ^Vehicle"' and 

^^The linearization process works for any binary/relational knowledge representation, such as 
CGs, entity-relationship data modelling |JRW] . relational database systems |Ken5) or the Infor- 
mation Flow Framework IIFFII . In the entity-relationship data modelling, n-ary relationship links 
are replaced by n-ary spans of aspects and attributes are included as types, 
is the universal type to which all types have a unique aspect. 
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'"City"' — > '"Location"'. At the next stage it could define a concept type '"C"' with as- 
pects ^C^ P"'""") ^Person^, ^C^ ^ ^Bus^ and ^C^ ^ ^City^, and link it with 
the reference ontology concept '"Spatial-Process"' by specifying a connecting aspect 
'"C^ '^^°'^'^^^) '"Spatial-Process"' and asserting the facts 'person ; is = process ; agent', 
'bus ; is = process ; vehicle' and 'city ; is = process ; locatioii'El In the more ex- 
pressive ologs with joins (Sectional), the process concept of "going to city by bus" 
can then be defined as the puUback of the "spatial process" concept: here, the 

concept type '"Go"' with aspects ^Go"' ''°'^^°"> '"Person"', ^Go"' '"Bus"' and 

'"Go"' — ^ '"City"' is pulled back along the above injective aspects, resulting in the 

injective aspect '"Go"' '"Spatial-Process"' with corresponding concept type order- 
ing ^Go"' < ^Spatial-Process"'. As a result, the concept ^C"' has the new mediating 

aspect C ^°'"^> Go, which satisfies the fact 'going ; is = process'. In this manner 
the community ontology C has been enlarged. 




V 

We assume that community ontology C and reference ontology T are combined into 

P A 

a portal ontology V with portal link C — > V and alignment link T — > P. If some 
other ontology C is built up and aligned in the same fashion, then T is being used 
as a common ground, and we have a 'W'-shaped information system 

(25) C T C 




with portals P and P' being the final parts. This 'W'-shaped information system 
uses the sketch institution Sk for ologs. It can be compared to the 'W'-shaped 
information system in |Kenlj . which uses the information flow IF institution for 
(local) logics. 

5. More expressive ologs I 

In this section and the next ([5] and [5]) we will introduce limits and colimits 
within the context of ologs. These will allow authors to build ologs that are quite 



"^^The symbol ';' denotes concatenation or formal composition. 
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expressive. For example we can declare one type to be the union or intersection 
of other types. We do not assume mathematical knowledge beyond that of sets 
and functions, which were loosely defined in Section [2.21 However, the reader may 
benefit by consulting a reference on category theory, such as [ Awoj . 

The basic ologs discussed in previous sections are based on the mathematical 
notion of categories, whereas the olog presentation language we will discuss in this 
section and the next are based on general sketches (see jMak] ) . The difference is 
in what can be expressed: in basic ologs we can declare types, aspects, and facts, 
whereas in general ologs we can express ideas like products and sums, as we will 
see below. 

We will begin by discussing layouts, which will be represented categorically by 
"finite limits". As usual, the english terminology (layout) is not precise enough to 
express the notion we mean it to express (limit). Intuitively, a limit can be thought 
of as a system: it is a collection of units, each of a specific type, such that these 
units have compatible aspects. These will include types like ^a man and a woman 
with the same last name"'. In Section [S] we will discuss groupings, which will be 
represented by colimits. These will include types like '"a thing that is either a pear 
or a watermelon"'. 

5.1. Layouts. A dictionary might define the word layout as something like "a 
structured arrangement of items within certain limits; a plan for such arrangement." 
In other words, we can lay out or specify the need for a set of parts, each of a 
given type, such that the parts fit together well. This idea roughly corresponds 
to the notion of limits in category theory, especially limits in the category of sets. 
Given a diagram of sets and functions, its limit is the set of ways to accordingly 
choose one element from each. For example, we could have a type '"a car and 
a driver"', which category-theoretically is a product, but which we are calling a 
"layout" — a compound type whose parts are "laid out." Of course, the term layout 
is insufficient to express the precise meaning of limits, but it will have to do for 
now. To understand limits, one really only need understand puUbacks and products. 
These will be the subjects of Sections 15.21 and 15.31 or one can see jAwoj for more 
details. 

5.2. Fullbacks. Given three objects and two arrows arranged as to the left, the 
puUback is the commutative square to the right: 



C 

Given: 



the puUback is drawn: 




We write A = B XdC and say "A is the puUback of B and C over D." The question 
is, what does it signify? We will begin with some examples and then give a precise 
definition. 

Example 5.2.1. We will now give four examples to motivate the definition of pull- 
back. In the first example, (|26|) . both B and C will be subtypes of D, and in such 
cases the puUback will be their intersection. In the next two examples (P71 and 
only B will be a subtype of D, and in such cases the puUback will be the 
"corresponding subtype of C" (as should make sense upon inspection). In the last 
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example ([2S1), neither B nor C will be a subtype of D. In each line below, the 
pullback of the diagram to the left is the diagram to the right. The reader should 
think of the left-hand olog as a kind of problem to which the new box A in the 
right-hand olog is a solution. 
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(26) 



a loyal 
customer 



a wealthy 
customer 



D 



a customer 



(27) 



blue 



B 



a person 



has as 

favorite color 



a color 



(28) 



a woman 



a dog 



tias as owner 



D 



a person 



(29) 



a piece of 
furniture 



has 



a space ni 
our house 



has 



a width 



A=BxdC 

a customer 
that is wealthy 
and loyal 



a loyal 
customer 



a wealthy 
customer 



D 



a customer 



A=BxdC 



a person 
whose favorite 
color is blue 



favorite eolor 



blue 



a person 



favorite color 



D 



a color 



A=BxdC 



a dog whose 
owner is a 

woman 



has as owner 



a woman 



a dog 



has as owner 



D 



a person 



A=BxdC 



a pair (/, s) 
where / is a piece 
of furniture and s 
is a space in our 
house, and where 
/ and s have the 
same width 



a piece of 
furniture 



a space m 
our house 



has 



D 



a width 
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See Example 15 . 2 . 31 for a justification of these, in light of Definition 15.2.21 

The following is the definition of puUbacks in the category of sets. For an olog, 
the instance data are given by sets (at least in this paper, see Section [3]), so this 
definition suffices for now. See |Awoj for more details on puUbacks. 

Definition 5.2.2. Let B, C, and D be sets, and let /: _B — > _D and g: C ^ D he 

functions. The pullhack oi B D -(^ C, denoted B Xjj C, is defined to be the set 

BxdC -.^ {(6, c) I 6 e B, c e C, and f{b) = g{c)} 

together with the obvious maps B x £, C ^ B and B Xjj C ^ C, which send an 

element (6, c) to b and to c, respectively. In other words, the puUback oi B ^ D ^ 
C is a commutative square 

BxdC 

g 



Example 5.2.3. In Example 15.2.11 we gave four examples of puUbacks. For each, 

we will consider B ^ D -(^ C to he sets and functions as in Definition 15.2.21 and 
explain how the set A follows that definition, i.e. how its label fits with the set 
BxdC = {{b, c)\beB,ceC, and f{b) = g{c)}. 

In the case of ([25]), the set B x^C should consist of pairs (w, where ui is a 
wealthy customer, / is a loyal customer, and w is equal to I (as customers). But if 
w and I arc the same customer then (w, I) is just a customer that is both wealthy 
and loyal, not two different customers. In other words, an instance of the puUback 
is a customer that is both loyal and wealthy, so the label of A fits. 

In the case of (P7)) . the set B Xq C should consist of pairs (p, b) where p is a 
person, b is the color blue, and the favorite color of p is equal to b (as colors). In 
other words, it is a person whose favorite color is blue, so the label of A fits. If 
desired, one could instead label A with '"a pair (p, b) where p is a person, b is blue, 
and the favorite color of p is 6"'. 

In the case of (|28p . the set B x^ C should consist of pairs {d,w) where d is a 
dog, w is a woman, and the owner of d is equal to w (as people). In other words, 
it is a dog whose owner is a woman, so the label of A fits. If desired, one could 
instead label A with '"a pair (d, w) where d is a dog, w is a woman, and the owner 
of d is w^. 

In the case of ([2^ . the set S x ^ C should consist of pairs (/, s) where / is a 
piece of furniture, s is a space in our house, and the width of / is equal to the width 
of s. This is fits perfectly with the label of A. 

5.2.4. Using puUbacks to classify. To distinguish between two things, one must 
find a common aspect of the two things for which they have differing results. For 
example, a pen is different from a pencil in that they both use some material to 
write (a common aspect), but the two materials they use are different. Thus the 
material which a writing implement uses is an aspect of writing implements, and 
this aspect serves to segregate or classify them. We can think of three such writing- 
materials: graphite, ink, and pigment-wax. For each, we will make a layout in the 
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olog below: 



Ai=BxoCi 

a writing imple- 
ment that uses 
graphite 



graphite 



/A2=Bx 0C2 



a writing imple- 
ent that uses 
ijik 




<C3 



;ing imple- 
that uses 
pitm(|nt-wax 



pigment- 



a writing implement 



a writing material 



One could also replace the label of box Ai with "a pencil", the label of box A2 
with "a pen", and the label of box As with "a crayon"; in so doing, the layouts at 
the top would define a pencil, a pen, and a crayon to be a writing implement that 
uses respectively graphite, ink, and pigment-wax. 

5.2.5. Building pullbacks on pullbacks. There is a theorem in category theory which 
states the following. Suppose given two commutative squares 

1 ^3 *-5 



2 ^4 ^6 

such that the right-hand square (3,4,5,6) is a puUback. It follows that if the left- 
hand square (1,2,3,4) is a puUback then so is the big rectangle (1,2,5,6). It also 
follows that if the big rectangle (1,2,5,6) is a puUback then so is the left-hand square 
(1,2,3,4). This fact can be useful in authoring ologs. 

For example, the type ^a cellphone that has a bad battery"' is vague, but we can 
lay out precisely what it means using pullbacks: 





G 


less than 




between 


1 hour 




and 1 



A=Bxr>C 



a cellphone that 
has a bad battery 



a bad battery 



remains 



charged 



a cellphone 



a battery 



for 


a duration 
of time 


yields 


a range of 
numbers 
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The category-theoretic fact described above says that since A = B Xd C and 
C ^ D xp E, it follows that A = B xp E. That is, we can decuce the definition "a 
cellphone that has a bad battery is defined as a cellphone that has a battery which 
remains charged for less than one hour." In other words, A = B x p E. 

5.3. Products. Given a set of types (boxes) in an olog, one can select one in- 
stance from each. All the ways of doing just that comprise what is called the 
product of these types. For example, if A = '~a number between 1 and 10"' and 
B = '~a. letter between x and z^, the product includes a total of 30 elements, in- 
cluding (4, z). We are ready for the definition. 

Definition 5.3.1. Given sets A,B, their product, denoted Ax B, is the set 

Ax B ^ {{a, b) \ a e A and b e B}. 

There are two obvious projection maps A x B ^ A and A x B ^ B, sending the 
pair (a, b) to a and to b respectively. 

Example 5.3.2. In Example l5.2.H we presented the idea of a piece of furniture 
that was the same width as a space in the house. What if we say that '"a nice 
furniture placement"' is any space that is between 1 and 8 inches bigger than a 
piece of furniture? We can use a combination of products and puUbacks to create 
the appropriate type. 



A=BxdC 

a nice furniture 
placement 



a pair of widths 
(wi , W2 ) such that 
1 < W2 ~ wi < 8 



B=BixB2 

a pair (/, s) 
where / is a piece 
of furniture and s 
is a space in the 
house 




D=DixD-2 



a pair of widths 



a space m 
the house 



has 



£>2 



a width 



Here B and D are products and A is a puUback. This olog lays out what it means 
to be "a nice furniture placement" using products. The bottom horizontal aspect 
_B — >■ Z? is an example of a map obtained by the "universal property of products"; 
see Section [ 
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5.3.3. Products of more (or fewer) types. The product of two sets A and B was 
defined in l5.3.1l One may also take the product of three sets A, B, C in a similar 
way, so the elements are triples (a, b, c) where a £ A, 6 S -B, and c S C. In fact this 
idea holds for any number of sets. It even makes sense to take the product of one 
set (just A) or no sets! The product of one set is itself, and the product of no sets 
is the singleton set {*}. For more on this, see Section [?751 or [Mac]. 

5.4. Declaring an injective aspect. A function is called injective if different 
inputs always yield different outputs. For example the function that doubles every 
integer {x i— ?> 2x) is injective, whereas the function that squares every integer (x t->- 

x^) is not because 3^ = (—3)^. An example of an injective aspect is '"a woman"' -H- 
'~a person"' because different women are always different as people. An example of a 

. . , . , . [- -1 has as father -, , t rr , i 

non-mjective aspect is ' a person ' > ' a person ' because diiierent people 

may have the same father. 

The easiest way to indicate that an aspect is injective is to use a "hook arrow" 
a.s in f: A^ B, instead of a regular arrow f: A B, to denote it. For example, 
the first map is injective (and specified as such with a hook-arrow), but the second 
is not in the olog: 



a person 



has 



a personality 



can be classified 
as being of 



a personality type 



The author of this olog believes that no two people can have precisely the same 
personality (though they may have the same personality type) . 

We include injective aspects in this section because it turns out that injectivity 
can also be specified by puUbacks. See [nLl| for details. 

5.5. Singletons types. A singleton set is a set with one element; it can be consid- 
ered the "empty product." In other words if we denote A" = A x A x ■ ■ ■ A (where 
A is written n times), then A'^ is the empty product and is a singleton set. One can 
specify that a certain type has only one instance by annotating it with A = {*} in 
the olog. For example the olog 



A={*} 


is 


B 




God 




a good thing 











says that the author considers '"God"' to be single. As a more concrete example, the 
intersection of {a- £ M | a; > 0} and {y £ R | a; < 0} is a singleton set, as expressed 
in the olog 



A=BxoC={*} 



a real number z 
such that z > 
and z < 



c 



a real number x 
such that X > 



a real number y 
such that y < 



D 



a real number 
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The fact that A ^ B Xd C and A = {*} are declared indicates that there is only 
one possible instance of a real number that is in both B and C. 

5.6. The universal property of layouts. We cannot do the notion of universal 
properties justice in this paper, but the basic idea is as follows. Suppose that V 
is an olog, that Di,D2 are types in it, and that D = Di x D2 (together with its 
projection maps pi : Z) — > Di and p2'. D ^ D2) is their product. 

(30) 




The so-called universal property of products should be thought of as "an existence 
and uniqueness" claim in V. Namely, for any type X with maps f : X ^ Di and 
g: X — >■ D2, there is exactly one possible map m: X ^ D such that the facts 
/ = m;pi and g = m;p2 hold. 



(31) 




Di X D2 




This may sound esoteric, but consider the following example. 
The following olog is similar to the one in Example 15.3.21 



Bi 



a piece of 
furniture 



has 



Ci 



a width 



IS, in 
inches 



Di 



a number 



/f 

B=BixB2 



a pair (/, s) 
where / is a piece 
of furniture and s 
is a space in the 
house 



D=DixD2 



a pair of num- 
bers (^1,^2) 



4 



a space m 
the house 



has 



a width 



inches 



D2 



a number 



Here the only unlabeled map is the horizontal one B ^ D; how can we get away 
with leaving it unlabeled? How does a piece of furniture and a space in the house 
yield a pair of numbers? The answer is that B has a map to Di (the path across 
the top) and a map to D2 (the path across the bottom), and hence the universal 
property of products gives a unique arrow B ^ D such that the two facts indicated 
by checkmarks hold. (In terms of (PD|) and (|3ip we are using X = B.) In other 
words, there is exactly one way to take a piece of furniture and a space in the house 
and yield a pair of numbers if we enforce that the first number is the width in inches 
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of the piece of furniture and the second number is the width in inches of the space 
in the house. 

At this point we hope it is clear that the universal property of products is a 
useful and constructive one. We will not describe the other universal properties 
(either for puUbacks, singletons, or any colimits); as mentioned above they can be 
found in | Awo| . 

6. More expressive ologs II 

In this section we will describe various colimits, which are in some sense dual to 
limits. Whereas limits allow one to "lay out" a team consisting of many different 
interacting or non-interacting parts, cohmits allow one to "group" different types 
together. For example, whereas the product of '"a number between 1 and 10"' and 

letter between x and z~' has 30 elements (such as (3, y)), the coproduct of these 
two types has 13 elements (including 4). Just as "layout" is a too weak a word to 
capture the essence of limits, "grouping" is too weak a word to capture the essence 
of colimits, but it will have to do. 

Wc will start by describing coproducts or "disjoint unions" in Section [^?T1 Then 
we will describe pushouts in Section 16. 2| wherein one can declare some elements in 
a union to be equivalent to others. There is a category-theoretic duality between 
coproducts and products and between pushouts and puUbacks. It extends to a 
duality between surjections and injections and a duality between empty types and 
singleton types, the subject of Sections 16.31 and 16.41 The interested reader can see 
[Awoj for details. 

6.1. Coproducts. Coproducts are also called "disjoint unions." If A and B are 
sets with no members in common, then the coproduct of A and B is their union. 
However, if they have elements in common, one must include both copies in A 11 S 
and differentiate between them. Here is a definition. 

Definition 6.1.1. Given sets A and B, their coproduct, denoted AUB, is the set 

AUB = {{a, "A") I a e A} U {{b, "S") \ beB}. 

There are two obvious inclusion maps A ^ All B and B — > All B, sending a to 
(a, "A") and b to (6, "S"), respectively. 

If A and B have no elements in common, then the one can drop the "A" and "S" 
labels without changing the set AlIB in a substantial way. Here are two examples 
that should make the coproduct idea clear. 

Example 6.1.2. In the following olog the types A and B are disjoint, so the coproduct 
C = AH B is just the union. 



A 


is 


C=AUB 


is 


B 




a person 




a person or a cat 




a cat 
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Example 6.1.3. In the following olog, A and B are not disjoint, so care must be 
taken to differentiate common elements. 



C=AUB 



an animal 
that can fly 



labeled "A" is 



an animal that can fly 
(labeled "A") or an 
animal that can swim 
(labeled "B") 



B 



labeled "B" is 



an animal that 
can swim 



Since ducks can both swim and fly, each duck is found twice in C, once labeled as 
a flyer and once labeled as a swimmer. The types A and B are kept disjoint in C, 
which justifies the name "disjoint union." 

6.2. Pushouts. Pushouts can express unions in which an overlap is declared. They 
can also express "quotients," where different objects can be declared equivalent. 
Given three objects and two arrows arranged as to the left, the pushout is drawn 
as the commutative square to the right: 



A- 



C 



Given: 



the pushout is drawn: 



B 




We write D = BUj^C and say "D is the pushout of B and C along A." The question 
is, what does it signify? 

The idea is that an instance of the pushout B Ua C is any instance of B or any 
instance of C, but where some instances are considered equivalent to others. That 
is, for any instance of A, its _B-aspect is considered the same as its C-aspect. This 
is formalized in Definition 16.2.21 after being exemplified in Example 16.2.11 

Example 6.2.1. In each example below, the diagram to the right is the pushout of 
the diagram to the left. The new object, D, is the union of B and C, but instances 
of A are equated to their B and C aspects. This will be discussed after the two 
diagrams. 
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(32) 



a cell in the 
shoulder 



c 

a cell in 
the arm 



B 



a cell in the 
torso 



a cell in the 
shoulder 



is 


a cell in 




the arm 



B 



D=BUaC 



a cell in the 
torso 



a cell in the 
torso or arm 



(33) 



a college 
mathematics 



course 



yields 



an utterance 
of the phrase 
"too hard" 



a college 
course 



a college 
mathematics 



course 



yields 



an utterance 
of the phrase 
"too hard" 



a college 
course 



a college course, 
where every 
mathematics 



course is 



replaced by an 
utterance of the 
phrase "too 
hard" 



In Olog (1221), the shoulder is seen as part of the arm and part of the torso. When 
taking the union of these two parts, we do not want to "double-count" the shoulder 
(as would be done in the coproduct B UC, see Example I6.1.3|) . Thus we create a 
new type A for cells in the shoulder, which are considered the same whether viewed 
as cells in the arm or cells in the body. In general, if one wishes to take two things 
and glue them together, the glue serves as A and the two things serve as B and C, 
and the union (or grouping) is the pushout B Ua C. 

In Olog ()33p . if every mathematics course is simply "too hard," then when reading 
off a list of courses, each math course will not be read aloud but simply read as "too 
hard." To form D we begin by taking the union of B and C, and then we consider 
everything in A to be the same whether one looks at it as a course or as the phrase 
"too hard." The math courses are all blurred together as one thing. Thus we see 
that the power to equate different things can be exercised with pushouts. 

Definition 6.2.2. Let A,B, and C be sets and let f : A ^ B and g: A ^ C he 

functions. The pushout oi B ■(/- A C, denoted B Ha C, is the quotient of i? 11 C 
(see Definition 16. l.ip by the equivalence relation generated by declaring b c (i.e. 
b is equivalent to c) if: b £ B,c £ C, and there exists a £ A with /(a) = b and 
g{a) = c. 

6.3. Declaring a surjective aspect. A function f : A ^ B is called surjective 
if every value in B is the image of something in the domain A. For example. 
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the function which subtracts 1 from every integer (x i— !■ a; — 1) is surjective, be- 
cause every integer has a successor; whereas the function that doubles every integer 
{x I— > 2x) is not surjective because odd numbers arc not mapped to. The aspect is 

was published in 



'~a pubhshed paper"' 



'"an estabhshed journaP is surjective because 



every estabhshed journal has had at least one paper published in it. The aspect is 



'"a published paper"' 



has as first author 



> '"a person"' is not surjective because not every 



person is the first author of a published paper. 

The easiest way to indicate that an aspect is surjective is to denote it with a 
"two-headed arrow" as in f : A B. For example, the second map is surjective 
(and indicated with a two-headed arrow) in the olog 



a person 



has 



a personality 



can be classified 
as being of 



a documented 
personality type 



Here the first aspect is not considered surjective, presumably because the author 
imagines personalities had by no person. 

We include surjective aspects in this section because it turns out that surjectivity 
can also be specified by pushouts. See |nL2| for details. 

6.4. Empty types. The empty set is a set with no elements; it can be considered 
the "empty coproduct." In other words if we denote n*A = AlIAYL---]IA (where 
A is written n times), then * ^ is the empty coproduct and is the empty set. 
One can declare a type to be empty by annotating it with ^ = in the olog. For 
example the olog 



A=0 



a supernatural being 



says that the set of supernatural beings is empty. As a more concrete example, the 
intersection of positive numbers and negative numbers is empty, as expressed in the 
olog 



A=BxdC=I/I 



a real number z 
such that z < 
and z > 



a real number x 
such that a; > 



a real number y 
such that y < 



a real number 



6.5. Images. In what remains of Section [HI we will discuss how the ideas of this 
section and the previous (Section [5]) can be used together to create quite expressive 
ologs. First wc will discuss how each aspect f : A B has an "image," the subset of 
B that arc "hit" by /. Then, in Sections 16.61 and l 6 . 1\ we will discuss how ologs can 
express all primitive recursive functions and many other mathematical concepts. 
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Consider the olog 



(34) 



X 



a pair {p, c) where p is a 
person, c is a computer, 
and p owns c 



Y 



a person 



a computer 



Some people own more than one computer, and some computers are owned by more 
than one person. Some computers are not owned by a person, and some people do 
not own a computer. The purpose of this section is to show how to use ologs to 
capture ideas such as "a person who owns a computer" and "a computer that is 
owned by a person". These are called the images of p and c respectively. 

Every aspect has an image, and these are quite important for human under- 
standing. For example the image of the map '"a person"' father^ person"' is 
the type '"a father"'. In other words, a father is defined to be a person x for which 
there is some other person y such that x is the father of y. 

The image of a function / : A — > S is a commutative diagram (fact) 




im(/) 



where fs is surjcctivc and fi is injcctive (see Sections 16.31 and r5.4|) . Wc indicate that 
a type is the image of a map / by annotating it with Im(/), as in the following 
olog: 



a child 



has as parents 



a pair (w,m) where w is 
a woman and m is a man 



C=Im(/) 



a father 



a man 



Hopefully it is also clear that '"a person who owns a computer"' and ^a computer 
that is owned by a person"' are the images oip: X ^ Y and c: X Z (respectively) 
in Olog (jM]) . 

Using the label Ini(/) is the easiest way to indicate an image, although one 
can also do so categorically using limits and colimits. See |Mac| Chapter VIII] for 
details. 
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6.6. Application: Primitive recursion. We have already seen how ologs can be 
used to express a conceptual understanding of a situation (all the ologs thus far ex- 
emplify this idea) . In this section we hope to convince the reader that ologs arc also 
able to express certain computations. In particular we will show by example that 
primitive recursive functions (like factorial or fibonacci) can be expressed by ologs. 
In this way, we can to put computation and knowledge representation together into 
the same framework. It would be quite valuable to strengthen this connection by 
showing that Ologs (or an extension thereof) can express any recursive function 
(i.e. simulate Turing machines). This is an open research possibility. 

Example 6.6.1. In this example we will present an olog that can represent the 
"Factorial function," often denoted n M- n!, where for example the factorial of 4 is 
24. Recall that a natural number is any nonncgative whole number: 0, 1, 2, 3,4,.. .. 



fin) 



s;p = idA s;q^d;f io; f 



OJ 



s: m 



B=AxD 



a positive 
natural number 




a pair {p, q) where p 
is a positive natural 
number and g is a 
natural number 



a natural number 



The idea of this olog is to convey the factorial function as follows. A natural 
number is either zero or positive. Every positive natural number n has a decrement, 
n — 1. The factorial of zero is 1. The factorial of a positive number n is obtained 
by multiplying n by the factorial of n — 1. 

To more explicitly describe the above olog, we must describe its intended in- 
stances. Hopefully the instances of each type (^4 through E) are self-explanatory, 
so we will describe the grouping, the layout, the aspects, and the facts. The set of 
natural numbers is the disjoint union of zero and the set of positive natural numbers 
and the maps io and ii are the inclusions into the coproduct, which explains the 
grouping C ~ A 11 E. The layout B ^ A x D is self-explanatory, and the maps 
p and q are the projections from the product. The map d is the decrement map 
n I— >■ n — 1, the map w sends to 1, the map m is multiplication (n, n') ^-^ n * n' . 
Once TO, d, and w are so-defined, the first two facts [s;p = id^ and s;q = d; f) 
specify that s sends n to the pair (n, /(d(n))), and the second two facts specify 
that / sends to 1 and sends a positive number n to m{s{n)) = m{n, /(c?(n))), i.e. 
n goes to the product n * (n — 1)!. 
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The above olog defines the factorial function (/) in terms of itself, which is the 
hallmark of primitive recursion. Note, however, that this same olog can compute 
many things besides the factorial function. That is, nothing about the olog says that 
the instances of '"Zero"' is the set {0}, that lo sends to 1, that d is the decrement 
function, or that m is multiplication — changing any of these will change / as a 
function. For example, the same olog can be used to compute "triangle numbers" 
(e.g. f(4)=l+2+3+4=10) by simply changing the instances of uj and m in the 
obvious ways (use w = 0,m = + rather than uj = l,m = *)). For a radical 
departure, fix any forest (set of graphical trees) F, let E = '"zero"' represent its set 
of roots, A the other nodes, uj the constant function, d the parent function, and 
m sending (p, d{p)) to f(d{p)) + 1. Then for each tree in F and each node n in that 
tree, the function / will send n to its height on the tree. 

Primitive recursion is a powerful technique for deriving new functions from the 
repetition of others using a kind of "while loop." The general form of primitive 
recursive functions can be found in [BB J| . and it is not hard to imitate Example 
16.6. II for the general case. 

6.7. Application: defining mathematical concepts. In this subsection we 
hope to convince the reader that many mathematical concepts can be defined by 
ologs. This should not seem like much of a stretch: ologs describe relationships 
between sets, so we rely on the maxim that all of mathematics can be formulated 
within set theory. To make the idea explicit, however, we will recall the definition 
of pseudo-metric space (in I6.7.ip and then provide an olog with the same content 
(in ESI). 

Definition 6.7.1. Let R>o denote the set of non-negative real numbers. A pseudo- 
metric space is a pair {X, S) where A is a set and (5 : A x A — > M>o is a function 
with the following properties for all elements x,y, z G A: 

(1) S{x,x)=0; 

(2) 5{x,y) = S{y,x); and 

(3) 5{x,z) < 6{x,y) + 5{y,z). 
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(35) 



dp; y ^ di;y d2; z = do;z di\z = dp; z 
s;y — idj s; z = id-T s;S=g;0 5 ](f>=5 



\z = y 



d2;S ^f;a do;5 =f;b di;6 ^ f;c 



1=5x75x75 



a triple {xo,Xi,X2) of points in X 



do 



3=5x75 



a triple (a, b, c) of non- 
negative real numbers such 
that c < a + b 



a b c 



a pair {y, z) of points in X 



a non-negative real number 





6={*} 



a point in X 



{0} 



X 



a pseudo-metric space 



As long as the instances for the right-hand side of this olog are mathematically 
correct (i.e. we assign 4 the set of non- negative real numbers), this olog has the 
same content as Definition 16.7.11 One can use ologs to define usual metric spaces 
(in which Property (1) in Definition l6.7.1l is strengthened), but it would have taken 
too much space here. 

It should be clear that ologs provide a more precise and explicit description of 
any concept, relying less on the grammar of English and more on the mathematical 
"grammar" of sets and functions. Assumptions are exposed as all the working parts 
of an object need to be explicitly documented. Thus an olog is likely to be instantly 
readable by a theorem prover such as Coq ( |Coq| ), at least if one creates the olog 
within an appropriate Olog-Coq interface API. Moreover, various parts of this olog 
may be reusable in other contexts, and hence connect pseudo-metric spaces into a 
web of neighboring definitions and theorems. 

In fact, once a corpus of mathematics has been written in olog form, evidence of 
conjectures not yet proven could be written down as instance data. For example, 
one could record every known prime as instances of a type '"prime^ and a machine 
could automatically check that Goldbach's conjecture (written as an olog containing 
•"prime^ as a type) holds for all example "so far." With definitions, theorems, and 
examples all written in the same computer-readable language of ologs, one may 
hope for much more advanced searching and knowledge retrieval by humans. For 
example, one could formulate very precise questions as database queries and use 
SQL on the database corresponding to a given olog (see Section [ 
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7. Further directions 

Ologs are basically categories which have text labels to explain their intended 
semantic. As such there are many directions to explore ranging from quite theoret- 
ical to quite practical. Here we consider three main classes: extending the theory 
of ologs, studying communication with ologs, and implementing ologs in the real 
world. 

7.1. Extending the theory of ologs. In this paper we began by discussing basic 
ologs, which are rich enough to capture the semantic of many situations. In Sections 
[5] and [H] we added more expressivity to ologs to allow one to encode ideas such as 
intersections, unions, and images. However, ologs could be even more expressive. 
One could add "function types" (also known as exponentials); add a "subobject 
classifier type," which could allow for negation and complements as well as power- 
sets; or even add fixed sets (like the set of Strings) to the language of ologs. This 
is not too hard (using sketches, see [Mak| ) : the reason we did not include them in 
this paper was more because of space than any other reason. 

Another generalization would be to allow the instances of an olog to take values 
in a category other than Set. For example, one could have an instance-space 
rather than an instance-set, e.g. it is clear that the instances of the type '"a point 
on the unit circle"' constitute a topological space. One could similarly argue that 
the instances of the type '"a human invention"' have a topology or metric as well 
(e.g. as an invention, the cellphone is closer to the telephone than it is to artificial 
flavoring). Instance data on an olog C corresponds to a functor C — > Set in this 
paper, but it is quite easy to replace Set with a different category such as Top 
(the category of topological spaces), and this may have interesting uses in data 
modeling. 

In Section 16.71 we explicitly showed that pseudo-metric spaces (and we stated 
further that metric spaces) can be presented by ologs. It would be interesting to 
see if theorems could also be proven entirely within the context of ologs. If so, a 
teacher could first sketch a mathematical proof as a small or sparse olog C, and 
then use a functor C — > P to rigorously "zoom in" on that proof so that the sketch 
becomes a full-fiedged proof (as the maps in C are factored into understandable 
units in V). 

If ologs are to be viable venues in which to discuss results in mathematics, then 
they should be capable of describing all recursion, not just primitive recursion (as 
in Section [6. 6p . We do not yet have an understanding for how this can be done. If 
recursion can be fully defined with the ologs described above, it would be interesting 
to see it written out; if not, it would be interesting to understand what basic idea 
could be gracefully added to ologs so that recursion becomes expressible. 

In a different direction, one could test the expressive power of ologs by defining 
simple games, like Tic Tac Toe or Chess, using ologs. It would be impressive to 
define a vocabulary for writing games and a program which could automatically 
convert an olog-defined game into a playable computer game. This would show that 
the same theory that we have seen express ideas about fatherhood and factorials 
can also be used to invent games and program computers. 

7.2. Studying communication with ologs. As discussed in Section^ ologs can 
be connected by functors into networks that are not just 2-way, but 7i-way. These 
communication networks should be studied: what kinds of information can pass. 
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how reliable is it, how quickly can it spread, etc. This may be applicable in fields 
from economics to psychology to sociology. Such research may use results from 
established mathematics such as Network Coding Theory (see |YLC| ). 

In [SA| , we study how two or more entities (described as ologs) can communicate 
new ideas (not just new instance data) to each other. It would be interesting to 
see how well this "communication protocol" works in practice, and whether it can 
be theoretically automated. Furthermore, this communication protocol and any 
theoretical automation of it should be implemented on a computer to see if different 
database schemas can be meaningfully integrated with minimal human assistance. 

It may be possible to train children to create ologs about their interests or about 
a given lesson. These ologs would show how the child actually perceives something, 
which would probably be fascinating. By our experience and that of people we have 
taught, the process of building an olog usually leads to a clarification of the concepts 
involved. Moreover, a class project to connect the ologs of different students and 
between the students and the teacher, may have excellent pedagogical benefits. 

Finally, it may be interesting to study "local truth" vs. "global truth" in a 
network of ologs. Functorial connections between ologs can allow for translation of 
ideas between members of a group, but there may be ideas which do not extend 
globally, just as a Mobius band does not admit a global orientation. That is, given 
three parties on the Mobius band, any pair can agree on a compass orientation, but 
there is no choice that the three can simultaneously agree on. Similarly, whether 
or not it is possible to construct a global language which extends all the existing 
local ones could be determined if these local languages and their connections were 
entered into a computer olog system. 

7.3. Implementing ologs in the real world. Once ologs are implemented on 
computers, and once people learn how to author good ologs, much is possible. One 
advantage comes in searching the information space. Currently when we search for 
a concept (say in Google or on our hard drive), we can only describe the concept in 
words and hope that those words are found in a document describing the concept. 
That is, search is always text-based. Better would be if the concept is meaning- 
fully interconnected in a web of concepts (an olog) that could be navigated in a 
meaningful (as opposed to text-based) way. 

Indeed, this is the semantic web vision: When internet data is machine-readable, 
search becomes much more powerful. Currently, we rely on RDF scrapers that scour 
web pages for (subject, predicate, object) sentences and store them in RDF format, 
as though each such sentence is a fact. Since people are inputting their data as 
prose text, this may be the best available method for now; however, it is quite 
inaccurate (e.g. often 15% of the facts are wrong, a number which can lead to 
degeneration of deductive reasoning - see [MBCHj ). If ideas could be put on the 
internet such that they compatibly made sense to both human and computer, it 
would give a huge boost to the semantic web. We believe that ologs can serve as 
such a human-computer interface. 

While it is often assumed that because we all speak the same language we all 
must mean the same things by it, this is simply not true. The age-old question 
about whether "blue for me" is the same as "blue for you" is applicable to every 
single word and idiom in our language. There is no easy way to sync up different 
people's perceptions. If communication is to be efficient, agreements must be fairly 
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explicit and precise, and this precision demands a rigor that is simply unavailable 
in English prose. It is available in a network of ologs (as described in Section |4]). 

For example, the laws of the United States are hopelessly complex. Residents of 
the US are required to obey the laws. However, unlike the rules of the Scholastic 
Aptitude Test (SAT), which take 10 minutes for the proctor to read aloud, the 
laws of the US are never really expressed — the most important among them are 
hopefully picked up by cultural osmosis. If an olog was created which had enough 
detail that laws could be written in that format, then a woman could research for 
herself whether her landlord was required to fix her refrigerator or whether this 
was her responsibility. It may prove that the olog of laws is internally inconsistent, 
i.e. that it is impossible for a person to satisfy all the laws — such an analysis, if 
performed, could fundamentally change our outlook on the legal system. 

The same goes for science; information written up in articles is much less acces- 
sible than information that is entered into an ontology. However, the dream of a 
single universal ontology is untenable ( [Minj ) . Instead we must allow each lab or 
institute to create its own ontology, and then require citations to be functorial olog 
connections, rather than mere silo-to-silo pointers. Thus, a network of ologs should 
be created to represent the understanding of the modern scientific community as a 
multi-faceted whole. 

Another impetus for a scientist to write an olog about the study at hand is that, 
once an olog is made, it can be instantly converted to a database schema which the 
scientist can use to input all the data pertaining to this study. Indeed, if some data 
did not fit within this schema, then the olog must have been insufficient to begin 
with and should be modified to fully describe the experiment. If scientists work this 
way, then the separation between them and database modelers can be reduced or 
eliminated (the scientist assumes the database modeling role with little additional 
burden). Moreover, if functorial connections are established between the ologs of 
different labs, then data can be meaningfully shared along those connections, and 
ideas written in the language of one lab's olog can be translated automatically into 
the language of the other's. The speed and accuracy of scientific research should 
improve. 
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